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DNA ADENINE METHYLTRANSFERASES 
AND USES THEREOF 

GOVERNMENT RIGHTS 

The research that led to this application was supported in part by an 
5 NIH grant, and the government may have certain rights to the invention. 

BACKGROUND OF THE INVENTION 

A. Field of the invention 

This invention pertains to the field of microbiology and to the treatment 
of conditions caused by microbes. In particular, this invention pertains to the 
10 isolation, sequencing, and detection of a DNA adenine methyltransferase gene from a 

variety of micro-organisms. ^ 

B. Related Art 

Most organisms modify their genomic DNA by the methylation of 
specific nucleotide bases. DNA methylation is critical to gene regulation and repair 

15 of mutational lesions (for recent reviews see Jost and Saluz, DNA Methylation, 

Molecular Biology and Biological Significance. Birhauser Verlag, Basel, Switzerland 
(1993); Palmer and Marinus, Gene 143:1-12 (1994)). 

DNA methylation is catalyzed by a class of enzymes of varying 
substrate specificity called DNA methyltransferase enzymes. A DNA 

20 methyltransferase from the bacterium Caulobacter crescentus, cell cycle regulated 

methyltransferase ("CcrM" refers to the protein and '*ccrM" denotes the gene), 
methylates the adenine residue in the recognition sequence GANTC (Zweiger e( a/., 
A Mo/. BioL 235: 472-485, 1994; N denotes any nucleotide). CcrM is unusual, as it 
is not part of a restriction modification system, and is the only known prokaryotic 

25 DNA methyltransferase shown to be essential for viability (Stephens et a/., Proc. NatL 

Acad. Sci. 93:1210-1214, 1996) outside of a restriction modification system {i.e., a 
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coexpressed methylase and restriction enzyme which recognize a same nucleotide 
sequence). 

The CcrM protein, and therefore its DNA methylation activity, is 
present only at the pr^livisional stage of the cell cycle (Zweiger et a/., /. Mo/. Biol. 
235: 472-485, 1994; Stephens et a/., Proc. Natl. Acad. ScL 93:1210-1214, 1996). 
This is controlled in two ways; the ccrM gene is transcribed only in the predivisional 
cell (Stephens et a/., /. Bacteriol. 177:1662-1669, 1995) and the CcrM protein is 
highly unstable and is completely degraded by the time of cell division in a Lon 
protease dependent process (Wright et a/.. Genes and Development 10:1532-1542, 
1996). 

SUMMARY OF THE INVENTION 

The present invention comprises the isolation and sequence of a 
number of methyltransferase-encoding nucleic acids and their gene products, 
including the methyltransferase gene from Rhizobium meiiloti, Brucella abortus, 
Agrobacterium tumefaciens, and Helicobacter pylori. These novel DNA 
methyltransferases are potential targets for new antimicrobial agents. Under the assay 
conditions provided herein, these enzymes exhibit a novel property called 
processivity. 

In one series of embodiments, the invention comprises an isolated 
nucleic acid that encodes a Rhizobium meiiloti DNA methyltransferase, including a 
nucleic acid having SEQ ID NO:l; cells that contain and express such nucleic acids; 
and isolated DNA adenine methyltransferases encoded by such a nucleic acid (e.g., 
SEQ ID NO: 2). 

In another series of embodiments, the invention comprises an isolated 
nucleic acid that encodes a erace//a abortus DNA methyltransferase (e.g., SEQ ID 
NO:4), particularly a nucleic acid having SEQ ID NO:3; cells that contain and 
express such nucleic acids, and isolated DNA adenine methyltransferases encoded by 

such nucleic acid. 

In another series of embodiments, the invention comprises an isolated 
nucleic acid (e.g., SEQ ID NO: 5) that encodes a partial sequence of Agrobacterium 
tumefaciens DNA methyltransferase (e.g., SEQ ID NO: 6). 
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In another series of embodiments, the invention comprises an isolated 
nucleic acid (e.g., SBQ ID NO: 7) that encodes a Helicobacter pylori DNA 
methyltransferase (e.g., SEQ ID NO: 8); cells that contain and express such nucleic 
acids, and isolated DMA adenine methyltransferases encoded by such nucleic acid. 
5 The ccrM genes for Rhizobium meliloti, Agrobacterium tumefaciens 

and Brucella abortus exhibit homology to Caulobacter ccrM. It is highly likely that 
the ccrM homologs are a new DNA methyltransferase family which is not part of a 
restriction modification system. 

Both Caulobacter and Rhizobium ccrM are.essential for viability. 
10 Neither gene can be disrupted from the chromosome unless a copy is provided in 

trans on a plasmid (Stephens et a/., Proc. NaCL Acad. Sci. 93:1210-1214, 1996; this 
application). The overexpression of both Rhizobium and Caulobacter ccrM results in 
defects in cell morphology and cell division, demonstrating the importance of DNA 
methylation in these two bacteria. Hemimethylated DNA could be detected in both 
J 5 Rhizobium and Caulobacter. In the case of Caulobacter this is due to the cell cycle 

regulation of ccrM. 

In another embodiment, this invention provides for vectors 
incorporating any of the above-described nucleic acids. The vectors preferably 
include the above-described nucleic acid operably linked to (under the control oO a 
20 promoter, either constitutive or inducible. The vector can also include an initiation 

and a termination codon. 

In another embodiment, this invention provides for cells that contain 
the above-mentioned nucleic acids and cells that express the above-mentioned 
nucleic acids that encode adenine methyltransferases. For example, host cells may 
25 be transfected with a nucleic acid of SEQ ID NO: 1, 3, 5, or 7. 

In addition to providing for host cells stably transfected with nucleic 
acids encoding adenine methyltransferases, this invention also uses these transfected 
host cells to detect compounds that are capable of inhibiting adenine 
methyltransferase. 

30 The invention further provides for nucleic acid probes that are capable 

of selectively hybridizing to a nucleic acid encoding an adenine methyltransferase. 
For example, the nucleic acid probe can be the nucleic acid of SEQ ID NO: 1, 3, 5, 
or 7. These probes can be used to measure or detect nucleic acids encoding adenine 
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methyltransferases. The probes are incubated with a biological sample to form a 
hybrid of the probe with complementary nucleic acid sequences present in the 
sample. The extent of hybridization of the probe to these complementary nucleic 
acid sequences is then determined. 

In another embodiment, this invention provides for antibodies to the 
methyltransferases encoded by the above-mentioned nucleic acids. Particularly 
preferred antibodies specifically bind a polypeptide comprising at least 10, more 
preferably at least 20, 40, 50, and most preferably at least 100, 200, and even 300 
contiguous amino acids, or even the full length polypeptide encoded by a nucleic 
acid selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID 
NO: 5, or SEQ ID NO: 7; wherein said polypeptide elicits the production of an 
antiserum or antibody which specifically binds to a polypeptide selected from the 
group consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ. ID NO: 6, or SEQ ID NO: 
8, wherein the antiserum or antibody preferably does not cross-react with the C. 
crescentus adenine methyltransferase. The antibody can be polyclonal or 
monoclonal. The antibody can also be humanized or human. 

This invention also provides for cells (e.g., recombinant cells such as 
hybridomas or triomas) which synthesize any of the above-described antibodies. 

This invention also provides for kits for the detection and/or 
quantification of the above-mentioned nucleic acids. The kit can include a container 
containing one or more of any of the above identified nucleic acids, amplification 
primers, and antibodies with or without labels, free, or bound to a solid support as 
described herein. The kits can also include instructions for the use of one or more of 
these reagents in any of the assays described herein. 

This invention further provides for methods and assays for identification 
and screening for novel antibiotics that target the methyltransfecases of this invention. 
Such assays include those for screening for inhibitors of DNA methyltransferase 
activity that comprises:i. contacting in an aqueous reaction mixture a nucleic acid 
encoding a DNA methyltransferase wherein said methyltransferase has a molecular 
weight of about 30-45 kilodaltons and binds to a polyclonal antibody that specifically 
binds to a polypeptide from the group of polypeptides having SEQ ID NO:2, SEQ ID 
NO:4, SEQ ID NO:6, and SEQ ID NO:8 with an antisense agent that inhibits the 
expression of the methyltransferase; and ii. detecting the level of inhibition relative 
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to a control reaction mixture that is substantially identical to the reaction mixture of 
step i except that the antisense agent is not present in an amount effective to inhibit 
the expression of the methyltransferase. The methods include both in vivo and in 
vitro methods. The antisense agents can either be added exogenously or are 
produced endogenously through conventional recombinant gene methods. 

Other methods for screening include methods for assaying for inhibitors 
of DNA methyltransferase activity comprising the steps of: i. contacting an aqueous 
reaction mixture containing a DNA methyltransferase wherein said methyltransferase 
has a molecular weight of about 30-45 kilodaltons and binds to a polyclonal 
antibody that specifically binds to a polypeptide from the group of polypeptides 
having SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, and SEQ ID NO:8 with an agent 
that inhibits the biological activity of the methyltransferase; and, ii. detecting the 
level of inhibition relative to a control reaction mixture that is substantially identical 
to the reaction mixture of step i except that the inhibitory agent is not present in an 
amount effective to inhibit the expression of the methyltransferase. The DNA 
methyltransferase is not contained within a living cell or the assay can be an in vivo 
assay where the enzyme is inhibited within a living cell. 

Processive assays are also described herein such as an assay for 
detecting antibiotics that target processive adenine methyitransferases, comprising: 
i) contacting a methyltransferase with a methyltransferase substrate in the presence 
and absence of a test substance; and b) detecting the enzymatic activity of the 
methyltransferase in the presence and absence of the test substance. 

Finally, this invention also provides therapeutic methods. These 
include methods of detecting infections with Brucella spp. and H. pylori by detecting 
the presence or absence of specific sequences of Brucella or H. pylori adenine 
' methyitransferases or by detecting the proteins themselves using^antibodies. Other 
methods include treating conditions caused by Agrobacterium spp., Rhizobium spp, 
and Helicobacter spp. Other methods involve administering to a mammal a 
therapeutically effective dose of a composition comprising a methyl transferase 
inhibitor and a pharmacological excipient. For animal associated bacteria, methods 
are preferably performed on mammals such as mice, rats, rabbits, sheep, goats, pigs, 
more preferably on primates including human patients. Of course for plant 
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associated bacteria such as Agrobacterium and Rhizobium spp., the preferred 
methods are performed on their respective host plants. 

BRIEF DESCRIPTION OF THE SEQUENCES 
Figure 1 is a sequence of a nucleic acid that encodes a Rhizobium 
meliloti DNA methyltransferase (SEQ ID NO:1). The start codon is boxed and the 

stop codon is circled. 

Figure 2 is the peptide sequence of a Rhizobium meliloti DNA 

methyltransferase (SEQ ID NO:2). 

Figure 3 is a sequence of a nucleic acid that encodes a Brucella abortus 
DNA methyltransferase (SEQ ID NO:3). The start codon is boxed and the stop 

codon is circled. 

Figure 4 is a peptide sequence of a Brucella abortus DNA 
methyltransferase (SEQ ID NO:4). 

Figure 5 is a partial sequence of a nucleic acid that encodes an 
Agrobacterium tumefaciens DNA methyltransferase (SEQ ID NO:5). 

Figure 6 is a partial peptide sequence of an Agrobacterium tumefaciens 
DNA methyltransferase (SEQ ID NO:6). 

Figure 7 is a complete sequence of a nucleic acid that encodes a 
Helicobacter pylori DNA methyltransferase (SEQ ID NO: 7). 

Figure 8 is a complete peptide sequence of a Helicobacter pylori DNA 
methyltransferase (SEQ ID NO:8). 

UST OF TABLES 

Table 1 is a comparison of the sequences of Caulobacter crescentus 
("Ccr"), Rhizobium meliloti ("Rme"), Agrobacterium tumefaciensA" Atu"), Brucella 
abortus ("Bab"), and Helicobaaer pylori ("Hpy") DNA adenine methyltransferases. 
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DETAILED DESCRIPTION OF THE INVENTION 

A. Definitions 

The term "biological activity" in the context of DNA methyltransferase 
refers to the capacity of the enzyme to act as a methyltransferase as defined herein. 

The term "methyltransferase" denotes an enzyme that transfers a methyl 
group from a methyl donor to a specific site on a nucleic acid substrate, wherein the 
specific site is preferably a specific base in a characteristic sequence present in the 
nucleic acid substrate. 

The term "processive" methyltransferase signifies that, under the assay 
conditions used, whenever there is more than one potential methylation site on a 
DNA substrate, after methylating a first site the methyltransferase methyiates the 
second or subsequent sites without dissociating from the DNA substrate. 

The term "DNA-dependent" signifies that the methyltransferase tends to 
lose activity in solution in the absence of a DNA substrate. 

The term "nucleic acid" refers to deoxyribonucieotides or 
ribonucleotides and polymers thereof in either single- or double-stranded form. 
Unless specifically limited, the term encompasses nucleic acids containing known 
analogues of natural nucleotides which have similar binding properties as the 
reference nucleic acid and are metabolized in a manner similar to naturally occurring 
nucleotides. Unless otherwise indicated, a particular nucleic acid sequence also 
implicitly encompasses conservatively modified variants thereof (e.g. degenerate 
codon substitutions) and complementary sequences and as well as the sequence 
explicitly indicated. Specifically, degenerate codon substitutions may be achieved by 
generating sequences in which the third position of one or more selected (or all) 
codons is substituted with mixed-base and/or deoxyinosine residues (Batzer et a/.. 
Nucleic Acid Res. 7 9:5081 (1991); Ohtsuka et a/., /. BioL Chenu 260:2605-2608 
(1985); and Cassol et a/., 1992; Rossolini et a/., Mol. Cell, Probes 5:91-98 (1994)). 
The term nucleic acid is used interchangeably with gene, cDNA, and mRNA 
encoded by a gene. 

The phrase "exogenous" or "heterologous nucleic acid" generally 
denotes a nucleic acid that has been isolated, cloned and 11 gated to a nucleic acid 
with which it is not combined in nature, and/or introduced into and/or expressed in 
a cell or cellular environment other than the cell or cellular environment in which 
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said nucleic acid or protein may typically be found in nature. The term encompasses 
both nudeic acids originally obtained from a different organism or cell type than the 
cell type in which it is expressed, and also nucleic acids that are obtained from the 
same cell line as the cell line in which it is expressed. 

"Nucleic acid probes" may be DNA or RNA fragments. DNA 
fragments can be prepared, for example, by digesting plasmid DNA, or by use of 
PGR, or synthesized by either the phosphoramidite method described by Beaucage 
and Carruthers, Tetrahedron Lett. 22:1859-1862 (1981), or by the triester method 
according to Matteucci, et a/., I. Am. Chem. Soc, 103:3185 (1981), both 
incorporated herein by reference. A double stranded fragment may then be 
obtained, if desired, by annealing the chemically synthesized single strands together 
under appropriate conditions or by synthesizing the complementar/ strand using 
DNA polymerase with an appropriate primer sequence. Where a specific sequence 
for a nucleic acid probe is given, it is understood that the complementary strand is 
also identified and included. The complementary strand will work equally well in 
situations where the target is a double-stranded nucleic acid. 

The phrase "selectively hybridizing to" refers to a nucleic acid probe 
that hybridizes, duplexes or binds only to a particular target DNA or RNA sequence 
when the target sequences are present in a preparation of total cellular DNA or RNA. 
"Complementary" or "target" nucleic acid sequences refer to those nucleic add 
sequences which selectively hybridize to a nucleic acid probe. Proper annealing 
conditions depend, for example, upon a probe's length, base composition, and the 
number of mismatches and their position on the probe, and must often be 
determined empirically. For discussions of nucleic acid probe design and annealing 
conditions, see, for example, Sambrook et a/.. Molecular Oor^ing: A Laboratory 
Manual (2nd ed.). Vols. 1-3, Cold Spring Harbor Laboratory (I9fl9), or Current 
Protocols in Molecular Biology, F. Ausubel et a/., ed. Greene Publishing and Wiley- 

Interscience, New York (1987). 

The phrase "a nucleic acid sequence encoding" refers to a nucleic acid 
which contains sequence information for a structural RNA such as rRNA, a tRNA, or 
the primary amino acid sequence of a spedfic protein or peptide, or a binding site 
for a trans-acting regulatory agent. This phrase specifically encompasses degenerate 
codons (i.e., different codons which encode a single amino acid) of the native 
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sequence or sequences which may be introduced to conform with codon preference 
in a specific host cell. 

"Nucleic acid probes" may be DNA or RNA fragments. DNA 
fragments can be prepared, for example, by digesting plasmid DNA, or by use of 
PGR, or synthesized by either the phosphoramidite method described by Beaucage 
and Carruthers, Tetrahedron Lett. 22:1859-1862 (1981), or by the triester method 
according to Matteucci, et a/., /. Am. Chem, Soc, 103:3185 (1981), both 
incorporated herein by reference. A double stranded fragment may then be 
obtained, if desired, by annealing the chemically synthesized single strands together 
under appropriate conditions or by synthesizing the complementary strand using 
DNA polymerase with an appropriate primer sequence. Where a specific sequence 
for a nucleic acid probe is given, it is understood that the complementary strand is 
also identified and included. The complementary strand will work equally well in 
situations where the target is a double-stranded nucleic acid. 

The phrase "selectively hybridizing to" refers to a nucleic acid probe 
that hybridizes, duplexes or binds only to a particular target DNA or RNA sequence 
when the target sequences are present in a preparation of total cellular DNA or RNA. 
"Complementary" or "target" nucleic acid sequences refer to those nucleic acid 
sequences which selectively hybridize to a nucleic acid probe. Proper annealing 
conditions depend, for example, upon a probe's length, base composition, and the 
number of mismatches and their position on the probe, and must often be 
determined empirically. For discussions of nucleic acid probe design and annealing 
conditions, see, for example, Sambrook et a/.. Molecular Cloning: A Laboratory 
Manual (2nd ed.). Vols. 1-3, Cold Spring Harbor Laboratory (1989), or Current 
Protocols in Molecular Biology, F. Ausubel et a/., ed. Greene Publishing and Wiley- 
Interscience, New York (1987), 

The term "isolated", when applied to a nucleic acid or protein, denotes 
that the nucleic acid or protein is essentially free of other cellular components with 
which it is associated in the natural state. It is preferably in a homogeneous state 
although it'can be in either a dry or aqueous solution. Purity and homogeneity are 
typically determined using analytical chemistry techniques such as polyacrylamide 
gel electrophoresis or high performance liquid chromatography. A protein which is 
the predominant species present in a preparation is substantially purified. In 
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particular, an isolated DNA methyltransferase gene is separated from open reading 
frames which naturally flank the gene and encode a protein other than 
methyltransferase. The term "purified" denotes that a nucleic acid or protein gives 
rise to essentially one band in an electrophoretic gel. Particularly, it means that the 
nucleic acid or protein is at least 85% pure, more preferably at least 95% pure, and 
most preferably at least 99% pure. 

The term "recombinant" or "engineered" when used with reference to a 
nucleic acid or a protein generally denotes that the composition or primary sequence 
of said nucleic acid or protein has been altered from the naturally occurring 
sequence using experimental manipulations well known to those skilled in the art. It 
may also denote that a nucleic acid or protein has been isolated and cloned into a 
vector or a nucleic acid that has been introduced into or expressed in a cell or 
cellular environment, particularly in a cell or cellular environment other than the cell 
or cellular environment in which said nucleic acid or protein may be found in 
nature. 

The term "recombinant" or "engineered" when used with reference to a 
cell indicates that the cell replicates or expresses a nucleic acid, or produces a 
peptide or protein encoded by a nucleic acid, whose origin is exogenous to the cell. 
Recombinant cells can express nucleic acids that are not found within the native 
(nonrecombinant) form of the cell. Recombinant cells can also express nucleic acids 
found in the native form of the cell wherein the nucleic acids are re-introduced into 

the cell by artificial means. 

The following terms are used to describe the sequence relationships 
between two or more nucleic acids or polynucleotides: "reference sequence", 
"comparison window", "sequence identity", "percentage of sequence identity", and 
"substantial identity". A "reference sequence" is a defined sequence used as a basis 
for a sequence comparison; a reference sequence may be a subset of a larger 
sequence, for example, as a segment of a full-length cDNA or gene sequence given 
in a sequence listing, such as the nucleic acid sequence of SEQ ID NO: 1, 3, 5, or 7, 
or may comprise a complete cDNA or gene sequence. 

Optimal alignment of sequences for aligning a comparison window 
may be conducted by the local homology algorithm of Smith and Waterman (1981) 
Adv. Appl. Math. 2:482, by the homology alignment algorithm of Needleman and 
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Wunsch (1970) /. Mol. BioL 48:443, by the search for similarity method of Pearson 
and Lipman (1988) Proc, NatL Acad. Sci. (USA) 85:2444, or by computerized 
implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the 
Wisconsin Genetics Software Package Release 7.0, Genetics Computer Group, 575 
Science Dr., Madison, Wl). 

The terms "substantial identity" or "substantial sequence identity", as 
applied to nucleic acid sequences and as used herein, denote a characteristic of a 
polynucleotide sequence, wherein the polynucleotide comprises a sequence that has 
at least 85 percent sequence identity, preferably at least 90 to 95 percent sequence 
identity, and more preferably at least 99 percent sequence identity as compared to a 
reference sequence over a comparison window of at least 20 nucleotide positions, 
frequently over a window of at least 25-50 nucleotides, wherein the percentage of 
sequence identity is calculated by comparing the reference sequence to the 
polynucleotide sequence which may include deletions or additions which total 20 
percent or less of the reference sequence over the window of comparison. The 
reference sequence may be a subset of a larger sequence. 

As applied to polypeptides, the terms "substantial identity" or 
"substantial sequence identity" mean that two peptide sequences, when optimally 
aligned, such as by the programs GAP or BESTFIT using default gap weights, share at 
least 70 percent sequence identity, preferably at least 80 percent sequence identity, 
more preferably at least 90 percent sequence identity, and most preferably at least 95 
percent amino acid identity or more. "Percentage amino acid identity'* or 
"percentage amino acid sequence identity" refers to a comparison of the amino acids 
of two polypeptides which, when optimally aligned, have approximately the 
designated percentage of the same amino acids. For example, "957o amino acid 
identity" refers to a comparison of the amino acids of two polypeptides which when 
optimally aligned have 957o amino acid identity. Preferably, residue positions which 
are not identical differ by conservative amino acid substitutions. For example, the 
substitution of amino acids having similar chemical properties such as charge or 
polarity are not likely to effect the properties of a protein. Examples include 
glutamine for asparagine or glutamic acid for aspartic acid. 

The term "substantially identical" in the context of two reaction 
mixtures refers to reaction mixtures that are considered by those of skill to be 
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sufficiently similar that scientifically valid comparisons can be made between them 
so as to compare relative aaivity due to the presence or absence of an inhibitor 
molecule. 

A cell has been "transformed" by an exogenous nucleic acid when such 
exogenous nucleic acid has been introduced inside the cell membrane. Exogenous 
DNA may or may not be integrated (covalently linked) into chromosomal DNA 
making up the genome of the ceil. The exogenous DNA may be maintained on an 
episomal element, such as a plasmid. A stably transfomned or transfected eukaryotic 
cell is generally one in which the exogenous DNA has become integrated into the 
chromosome so that it is inherited by daughter cells through chromosome 
replication, or one which includes stably maintained extrachromosomal plasmids. 
This stability is demonstrated by the ability of the eukaryotic cell to establish cell 
lines or clones comprised of a population of daughter cells containing the exogenous 
DNA. 

"Adenine methy [transferase substrate" refers to a nucleic acid that is 
acted upon by a DNA methyltransferase to undergo a methylation at an adenine 
residue. The optimum substrate contains at least one GANTC site and is preferably 
of a length that promotes ease of manipulation and yields easily resolvable 
methylation and/or restriction products, preferably a 45 base pair or longer 

oligonucleotide or plasmid. 

The phrase "an essential adenine DNA methyltransferase" indicates 
that, in the absence of this enzyme activity at the appropriate stage in the cell cycle, 
organisms that normally express adenine DNA methyltransferase at that stage will 
die. Enzyme activity may be impaired by a mutation in the enzyme, by the use of 
antisense nucleic acid, by intracellular proteolysis of the enzyme, or by the 
administration of an inhibitor of the enzyme. 

"Restriction" denotes the action of hydrolyzing a single or double 
stranded nucleic acid at a specific sequence or site. "Restriction enzyme" is a 
nuclease that recognizes a specific sequence or site of a nucleic acid, and cleaves the 
nucleic acid at that site. "Restriction site" is the particular sequence or site 
recognized and hydrolyzed by a restriction enzyme. 

The phrase "specifically binds to an antibody" or "specifically 
immunoreactive with", when referring to a protein or peptide, refers to a binding 
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reaction which is determinative of the presence of the protein in the presence of a 
heterogeneous population of proteins and other biologies. Thus, under designated 
immunoassay conditions, the specified antibodies bind to a particular protein and do 
not bind in a significant amount to other proteins present in the sample. Specific 
binding to an antibody under such conditions may require an antibody that is 
selected for its specificity for a particular protein. For example, antibodies raised to 
adenine methyltransferase with the amino acid sequence depicted inSEQ ID NO: 2, 
4, 6, or 8 can be selected to obtain antibodies specifically immunoreactive with that 
adenine methyltransferase and not with other proteins. A variety of immunoassay 
formats may be used to select antibodies specifically immunoreactive with a 
particular protein. For example, solid-phase ELISA immunoassays are routinely used 
to select monoclonal antibodies specifically immunoreactive with a protein. See 
Harlow and Lane (1988) Antibodies, A Laboratory Manual, Cold Spring Harbor 
Publications, New York, for a description of immunoassay formats and conditions 
that can be used to determine specific immunoreactivity. 

B. General Background 

This invention relates to isolated nucleic acid sequences encoding DNA 
adenine methyltransferases. DNA methyltransferases are present in gram-negative 
bacteria such as the free living bacteria Caulobacter, the agriculturally important 
nitrogen-fixing bacterium Rhizobium and the highly infectious animal pathogen 
Brucella. The precise sequences and properties of these methyltransferase genes and 
enzymes are unknown. Prior to the work summarized herein, it was not clear 
whether the methyltransferases of other organisms would have homologous 
sequences and properties. 

The procedure for obtaining methyltransferase genes from selected 
organisms generally involves constructing or obtaining gene libraries from selected 
organisms, detecting and isolating the desired gene, cloning it, and expressing it in a 
suitable bacterial strain or transformed cell line- 

The nucleic acid compositions of this invention, whether RNA, cDNA, 
genomic DNA, or a hybrid of the various combinations, may be isolated from natural 
sources or may be synthesized in vitro. The nucleic acids claimed may be present in 
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transformed or transfected whole cells, in a transformed or transfeaed cell lysate, or 
in a partially purified or substantially pure form. 

Techniques for nucleic acid manipulation of genes encoding the DNA 
adenine methyltransferases such as generating libraries, subcloning into expression 
vectors, labeling probes, DNA hybridization, and the like are described generally in 
Sambrook, et a/.. Molecular Cloning - A Laboratory Manual (2nd Ed.), Vol. 1-3, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New York, 1989, which is 
incorporated herein by reference. This manual is hereinafter referred to as 
"Sambrook, et a/." 

Nucleic acids and proteins are detected and quantified herein by any of 
a number of means known to those of skill in the art. These include analytical 
biochemical methods such as spectrophotometry, radiography, electrophoresis, 
capillary electrophoresis, high performance liquid chromatography (HPLQ, thin layer 
chromatography (TLO, hyperdiffusion chromatography, and the like, and various 
immunological methods such as fluid or gel precipitin reactions, immunodiffusion 
(single or double), immunoelectrophoresis, radioimmunoassays (RIAs), enzyme-linked 
immunosorbent assays (EUSAs), immunofluorescent assays, and the like. The 
detection of nucleic acids proceeds by well known methods such as Southern 
analysis, northern analysis, gel electrophoresis, PCR, radiolabeling, scintillation 
counting, and affinity chromatography. 

1. Isolation of nucleic acids encoding DNA adenine methyltransferases 

There are various methods of isolating the DNA sequences encoding 
DNA adenine methyltransferases. For example, DNA is isolated from a genomic or 
cDNA library using labelled oligonucleotide probes (e.g., probes having sequences 
complementary to the sequences disclosed herein, such as SEQJD NO: 1, 3, 5, 7, 9- 
11). The libraries are generated from DNA and mRNA from cultures of bacteria that 
are generated from stock cultures. Stock cultures are commercially available from a 
variety of sources including international depositories such as the American Type 
Culture Collection. 

The probes for surveying the libraries can be used directly in 
hybridization assays to isolate DNA encoding DNA adenine methyltransferases. 
Alternatively, probes can be designed for use in amplification techniques such as 
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PCR, and DNA encoding DNA adenine methyltransferases may be isolated by using 
methods such as PCR (see below). 

Methods for making and screening DNA libraries are well established. 
See Gubler, U. and Hoffman, B.J. Gene 25:263-269, 1983 and Sambrook, et a/. To 
prepare a genomic library, the DNA is generally extracted from cells and either 
mechanically sheared or enzymatically digested to yield fragments of about 12-20 kb. 
The fragments are then separated by gradient centrifugation from undesired sizes and 
are subcloned in bacteriophage lambda vectors. These vectors and phage are 
packaged in vitro, as described in Sambrook, et a/. The vector is transformed into a 
recombinant host for propagation, screening and cloning. Recombinant phage are 
analyzed by plaque hybridization as described in Benton and Davis, Science, 
196:180-182 (1977), Colony hybridization is carried out as generally described in 
M. Grunstein et al. Proc. Natl. Acad. Sc/. USA., 72:3961-3965 (1975). 

DNA encoding a DNA adenine methyltransferase is identified in either 
cDNA or genomic libraries by its ability to hybridize with nucleic acid probes, for 
example on Southern blots, and these DNA regions are isolated by standard methods 
familiar to those of skill in the art. See Sambrook, ef al. The nucleic acid sequences 
of the invention are typically identical to or show substantia! sequence identity 
(determined as described below) to the nucleic acid sequence of SEQ ID. No. 1, 3, 
5, or 7. Nucleic acids encoding DNA adenine methyltransferases will typically 
hybridize to the nucleic acid sequence of SEQ ID NO: 1, 3, 5, or 7 under stringent 
conditions. For example, nucleic acids encoding DNA adenine methyltransferases 
will hybridize to the nucleic acid of sequence ID No. 1 under the hybridization and 
wash conditions of 507© formamide at 42 °C. Other stringent hybridization 
conditions may also be selected. Generally, stringent conditions are selected to be 
about 5°C lower than the thermal melting point (Tm) for the specific sequence at a 
defined ionic strength and pH. The Tm is the temperature (under defined ionic 
strength and pH) at which 507© of the target sequence hybridizes to a perfectly 
matched probe. Typically, stringent conditions will be those in which the salt 
concentration is at least about 0.02 molar at pH 7 and the temperature is at least 
about 60°C. As other factors may significantly affect the stringency of hybridization, 
including, among others, base composition and size of the complementary strands. 
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the presence of organic solvents and the extent of base misnnatching, the 
connbination of parameters is more important than the absolute measure of any one. 

Various methods of amplifying target sequences, such as the 
polymerase chain reaction, can also be used to prepare DNA encoding DNA adenine 
methyltransferase. Polymerase chain reaaion (PGR) technology is used to amplify 
such nucleic acid sequences. The isolated sequences encoding DNA adenine 
methyltransferase may also be used as templates for PGR amplification. 

In PGR techniques, oligonucleotide primers complementary to the two 
3' borders of the DNA region to be amplified are synthesized. The polymerase chain 
reaction is then carried out using the two primers. See PGR Protocols: A Guide to 
Methods and Applications. (Innis, M, Gelfand, D,, Sninsky, j. and White, T., eds.). 
Academic Press, San Diego (1990). Primers can be selected to amplify the entire 
regions encoding a fulMength DNA adenine methyltransferase or to amplify smaller 
DNA segments as desired. 

PGR can be used in a variety of protocols to isolate nucleic acids 
encoding the DNA adenine methyltransferases. In these protocols, appropriate 
primers and probes for amplifying DNA encoding DNA adenine methyltransferases 
are generated from analysis of the DNA sequences listed herein. For example, the 
oligonucleotides of SEQ ID Nos. 9-11 can be used in a PGR protocol as described in 
example 1 herein to amplify regions of DNA's encoding methyl transferase proteins. 
Once such regions are PGR-amplified, they can be sequenced and oligonucleotide 
probes can be prepared from sequence obtained. These probes can then be used to 
isolate DNA's encoding DNA adenine methyltransferases, similar to the procedure 
used in examples 1-4 herein. DNA adenine methyltransferases can be isolated from 
a variety of different cellular sources using this procedure. Other oligonucleotide 
probes in addition to those of SEQ ID NO: 1, 3, 5, 7 can also baused in PGR 
protocols to isolate cDNAs encoding the DNA adenine methyltransferases. Such 
probes are subsequences of the full-length coding sequences and can be from 20 
bases to full length and preferably 30-50 bases in length. 

Oligonucleotides for use as probes are chemically synthesized 
according to the solid phase phosphoramidite triester method first described by 
Beaucage, S.L. and Garruthers, M.H., 1981, Tetrahedron Lett., 22{20):1 859-1 862 
using an automated synthesizer, as described in Needham-VanDevanter, D.R., et a/., 
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1984, Nucleic Acids Res., 12:6159-6168. Purification of oligonucleotides is by 
either native acrylamide gel electroprnoresis or by an ion-exchange HPLC as described 
in Pearson, J.D. and Regnier, F.E., 1983, /. Chrom., 255:137-149. The sequence of 
the synthetic oligonucleotide can be verified using the chemical degradation method 
of Maxam, A.M. and Gilbert, W. 1980, in Grossman, L. and Moldave, D., eds. 
Academic Press, New York, Methods in Enzymology, 65:499-560. 

Other methods known to those of skill, in the art may also be used to 
isolate DNA encoding the DNA adenine methyltransferase. See Sambrook, et aL for 
a description of other techniques for the isolation of DNA encoding specific protein 
molecules. 

2. Expression of methyltransferase 

Once DNA encoding DNA adenine methyltransferases is isolated and 
cloned, one can express the DNA adenine methyltransferases in a variety of 
recombinantly engineered cells to ascertain that the isolated gene indeed encodes the 
desired methyltransferase. The expression of natural or synthetic nucleic acids is 
typically achieved by operably linking a nucleic acid of interest to a promoter (which 
is either constitutive or inducible), incorporating the construct into an expression 
vector, and introducing the vector into a suitable host ceil. Typical vectors contain 
transcription and translation terminators, transcription and translation initiation 
sequences, and promoters useful for regulation of the expression of the particular 
nucleic acid. The vectors optionally comprise generic expression cassettes 
containing at least one independent terminator sequence, sequences permitting 
replication of the cassette in eukaryotes, or prokaryotes, or both (e.g., shuttle vectors), 
and selection markers for both prokaryotic and eukaryotic systems. Vectors are 
suitable for replication and integration in prokaryotes, eukaryotes^ or preferably both. 
See, Giliman and Smith (1979), Gene, 8:81-97; Roberts e£ a/. (1987), Nature, 
328:731-734; Berger and Kimmel, Guide to Molecular Clor)ing Techniques, Methods 
in Enzymology, volume 152, Academic Press, Inc., San Diego, CA (Berger); 
Sambrook et aL (1989), Molecular Cloning - A Laboratory Manual (2nd ed.) Vol. 
1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor Press, N.Y., (Sambrook); and 
F.M. Ausubel et aL, Current Protocols in molecular Biology, eds.. Current 
Protocols, a joint venture between Greene Publishing Associates, Inc. and John Wiley 
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& Sons, Inc., (1994 Supplement) (Ausubel). Product information from manufacturers 
of biological reagents and experimental equipment also provide information useful in 
known biological methods. Such manufacturers include the SIGMA chemical 
company (Saint Louis, MO), R&D systems (Minneapolis, MN), Pharmacia LKB 
Biotechnology (Prscataway, NJ), CLONTECH Laboratories, Inc. (Palo Alto, CA), Chem 
Genes Corp., Aldrich Chemical Company (Milwaukee, Wl), Glen Research, Inc., 
GIBCO BRL Life Technologies, Inc. (Galthersberg, MD), Fluka Chemica-Biochemika 
Analytika (Fluka Chemie AG, Buchs, Switzerland), and Applied Biosystems (Foster 
City, CA), as well as many other commercial sources known to one of skill in the art. 

The nucleic acids (e.g., promoters and vectors) used in the present 
method can be isolated from natural sources, obtained from such sources as ATCC or 
GenBank libraries, or prepared by synthetic methods. Synthetic nucleic acids can be 
prepared by a variety of solution or solid phase methods. Detailed descriptions of 
the procedures for solid phase synthesis of nucleic acids by phosphite-triester, 
phosphotriester, and H-phosphonate chemistries are widely available. See, for 
example, Itakura, U.S. Pat. No. 4,401,796; Caruthers, et a/., U.S. Pat. Nos. 4,458,066 
and 4,500,707; Beaucage, et a/., (1981) Tetrahedron Lett., 22:1859-1862; Matteucci, 
(1981) et a/., /. Am. Chem. Soc, 103:3185-3191; Caruthers, et a/.,(1982) Genetic 
Engmeering, 4:1-17; Jones, chapter 2, Atkinson, et a/., chapter 3, and Sproat, et a/., 
chapter 4, in Oligor^ucleotide Syr^thesis: A Practical Approach, Gait (ed.), IRL Press, 
Washington D.C. (1984); Froehler, et a/., (1986) Tetrahedron Lett., 27:469-472; 
Froehler, et a/., (1986) Nucleic Acids Res., 14:5399-5407; Sinha, et a/. (1983) 
Tetrahedron Lett.. 24:5843-5846; and Sinha, et a/., (1984) Nucl. Acids Res., 12:4539- 
4557, which are incorporated herein by reference. 

a. In vitro gene transfer 

it is expected that those of skill in the art are knowledgeable in the 
numerous expression systems available for expression of DNA encoding DNA 
adenine methyltransferases. No attempt to describe in detail the various methods 
known for the expression of proteins in prokaryotes or eukaryotes is made here. 

There are several well established methods of introducing nucleic acids 
into bacterial and animal cells, any of which may be used in the present invention. 
These include: calcium phosphate precipitation, fusion of the recipient cells with 
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bacterial protoplasts containing the DNA, treatment of the recipient cells with 
liposomes containing the DNA, DEAE dextran, receptor-mediated endocytosis, 
electroporation, micro-injection of the DNA directly into the cells, infection with 
viral vectors, etc. 

For in vitro applications, the delivery of nucleic acids can be to any 
cell grown in culture, whether of bacteria!, plant or animal origin, vertebrate or 
invertebrate, and of any tissue or type. Contact between the cells and the genetically 
engineered nucleic acid constructs, when carried out in vitro, takes place in a 
biologically compatible medium. The concentration of nucleic acid varies widely 
depending on the particular application, but is generally between about 1 //mol and 
about 10 mmol. Treatment of the cells with the nucleic acid is generally carried out 
at physiological temperatures (about 37° C) for about 1 to about 48 hours, preferably 
about 2 to 4 hours. 

In one group of embodiments, a nucleic acid is added to 60-80% 
confluent plated ceils having a cell density of about 10^ to about 10^ cells/mL, more 
preferably about 2x10^ cells/mL. The concentration of the suspension added to 
the cells is preferably from about 0.01 to 0.2 //g/mL, more preferably about 0.1 
A/g/mL. 

b. Cells to be transformed 

The compositions and methods of the present invention are used to 
transfer genes into a wide variety of cell types, in vivo and in vitro. Although any 
prokaryotic or eukaryotic cells may be used, prokaryotic cells such as E. coli are 
preferred. 

c. Detection of methyltransferase-encoding nucleic acids 

The present invention provides methods for detecting DNA or RNA 
encoding DNA adenine methyltransferases. A variety of methods for specific DNA 
and RNA measurement using nucleic acid hybridization techniques are known to 
those of skill in the art. See Sambrook, et a/.; NUCLEIC AciD Hybridization, A 
Practical approach, Ed. Hames, B.D. and Higgins, S.J., IRL Press, 1985; Gall and 
Pardue (1969), Proc. Natl, Acad. 5c/., U.S.A., 63:378-383; and John et a/. (1969) 
Nature, 223:582-587. The selection of a hybridization format is not critical. 
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For example, one method for evaluating the presence or absence of 
DNA encoding DNA adenine methyltransferases in a sample involves a Southern 
transfer. Briefly, the digested genomic DNA is run on agarose slab gels in buffer and 
transferred to membranes. Hybridization is carried out using the nucleic acid probes 
discussed above. As described above, nucleic acid probes are designed based on the 
nucleic acid sequences encoding methyltransferases (See SEQ ID NOs: 1, 3, 5, 7.) 
The probes can be full length or less than the full length of the nucleic acid 
sequence encoding the methyltransferase. Shorter probes are empirically tested for 
specificity. Preferably nucleic acid probes are 20 bases or longer in length. (See 
Sambrook, et a/, for methods of selecting nucleic acid probe sequences for use in 
nucleic acid hybridization.) Visualization of the hybridized portions allows the 
qualitative determination of the presence or absence of DNA encoding DNA adenine 

methyltransferases. 

Similarly, a Northern transfer may be used for the detection of mRNA 
encoding DNA adenine methyltransferases. In brief, the mRNA is isolated from a 
given cell sample using an acid guanidinium-phenol-chloroform extraction method. 
The mRNA is then electrophoresed to separate the mRNA species and the mRNA is 
transferred from the gel to a nitrocellulose membrane. As with the Southern blots, 
labeled probes are used to identify the presence or absence of DNA adenine 

methyltransferases. 

Sandwich assays are commercially useful hybridization assays for 
detecting or isolating nucleic acid sequences. Such assays utilize a "capture" nucleic 
acid covalently immobilized to a solid support and a labelled "signal" nucleic acid in 
solution. The clinical sample will provide the target nucleic acid. The "capture" 
nucleic acid and "signal" nucleic acid probe hybridize with the target nucleic acid to 
fonn a "sandwich" hybridization complex. To be effective, the signal nucleic acid 
cannot hybridize with the capture nucleic acid. 

Typically, labelled signal nucleic acids are used to detect hybridization. 
Complementary nucleic acids or signal nucleic acids may be labelled by any one of 
several methods typically used to detect the presence of hybridized polynucleotides. 
The most common method of detection is the use of autoradiography with ^H, '"l, 
"S, '*C, or "P-labelled probes or the like. Other labels include ligands which bind 
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to labelled antibodies, fluorophores, chemiluminescent agents, enzymes, and 
antibodies which can serve as specific binding pair members for a labelled ligand. 

Detection of a hybridization complex may require the binding of a 
signal generating complex to a duplex of target and probe polynucleotides or nucleic 
acids. Typically, such binding occurs through ligand and anti-ligand interactions as 
between a ligand-conjugated probe and an anti-ligand conjugated with a signal. 

The label may also allow indirect detection of the hybridization 
complex. For example, where the label is a hapten or antigen, the sample can be 
detected by using antibodies. In these systems, a signal is generated by attaching 
fluorescent or enzyme molecules to the antibodies or, in some cases, by attachment 
to a radioactive label. {Tijssen, P., "Practice and Theory of Enzyme Immunoassays," 
Laboratory Techniques in Biochemistry and Molecular Biology, Burdon, R.H., van 
Knippenberg, P.H., Eds., Elsevier (1985), pp. 9-20.) 

The sensitivity of the hybridization assays may be enhanced through 
use of a nucleic acid amplification system which multiplies the target nucleic acid 
being detected. In vitro amplification techniques suitable for amplifying sequences 
for use as molecular probes or for generating nucleic acid fragments for subsequent 
subcloning are known. Examples of techniques sufficient to direct persons of skill 
through such in vitro amplification methods, including the polymerase chain reaction 
(PCR), the ligase chain reaction (LCR), Q>S-replicase amplification and other RNA 
polymerase mediated techniques (e.g., NASBA), are found in Berger, Sambrook, and 
Ausubel, as well as Mullis et a/. (1987), U.S. Patent No. 4,683,202; PCR Protocols A 
Guide to Methods and Applications (Innis et a/, eds) Academic Press Inc. San Diego, 
CA (1990) (Innis); Arnheim & Levinson (October 1, 1990), C&EN 36-47; The journal 
Of NIH Research (1991), 3: 81-94; (Kwoh et a/. (1989), Proc. Nat/. Acad. Sci. USA, 
86:1173; Guatelli et aL (1990), Prpc. Nat/. Acad. Sci. USA, 87:1.874; Lomell et a/. 
(1989), y. Clin. Chem., 35:1826; Landegren et a/. (1986), Science, 241:1077-1080; 
Van Brunt (1990), Biotechnology, 8:291-294; Wu and Wallace (1989), Gene, 4:560; 
Barringer et a/. (1990), Gene, 89:117, and Sooknanan and Maiek (1995), 
Biotechnology, 13:563-564. Improved methods of cloning in vitro amplified nucleic 
acids are described in Wallace et a/., U.S. Pat. No. 5,426,039. Other methods 
recently described in the art are the nucleic acid sequence based amplification 
(NASBA'", Cangene, Mississauga, Ontario) and Q Beta Replicase systems. These 
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systems can be used to directly identify mutants where the PGR or LCR primers are 
designed to be extended or ligated only when a select sequence is present. 
Altematively, the select sequences can be generally amplified using, for example, 
nonspecific PGR primers and the amplified target region later probed for a specific 
sequence indicative of a mutation. 

Oligonucleotides for use as probes, e.g., in in vitro amplification 
methods, for use as gene probes, or as inhibitor components are typically synthesized 
chemically according to the solid phase phosphoramidite triester method described 
by Beaucage and Caruthers (1981), Tetrahedron Letts., 22(20):1 859-1 862, e.g., using 
an automated synthesizer, as described in Needham-Van Devanter et a/. (1984), 
Nucleic Acids Res., 12:6159-6168. Purification of oligonucleotides, where 
necessary, is typically performed by either native acrylamide gel electrophoresis or 
by anion-exchange HPLG as described in Pearson and Regnier (1983), /. Chrom,, 
255:137-149. The sequence of the synthetic oligonucleotides can be verified using 
the chemical degradation method of Maxam and Gilbert (1980) in Grossman and 
Moldave (edsJ Academic Press, New York, Methods in Enzymology, 65:499-560. 

An alternative means for determining the level of expression of a gene 
encoding an DNA adenine methyltransferase is in situ hybridization. In situ 
hybridization assays are well known and are generally described in Angerer, et aL, 
Methods EnzymoL, 152:649-660 (1987). In an in situ hybridization assay, cells are 
fixed to a solid support, typically a glass slide. If DNA is to be probed, the cells are 
denatured with heat or alkali. The cells are then contacted with a hybridization 
solution at a moderate temperature to permit annealing of labeled probes specific to 
DNA adenine methy transferases. The probes are preferably labeled with 
radioisotopes or fluorescent reporters. 

d. Detection of methyltransferase gene products 

Methyltransferase may be detected or quantified by a variety of 
methods. Preferred methods involve the use of specific antibodies. 

Methods of producing polyclonal and monoclonal antibodies are 
known to those of skill in the art. See, e.g., Goligan (1991), Gurrent Protocols in 
Immunology, Wiley/Greene, NY; and Hariow and Lane (1989), Antibodies: A 
Laboratory Manual, Gold Spring Harbor Press, NY; Stites et a/, (eds.) Basic and 
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Clinical Immunology (4th ed.) Lange Medical Publications, Los Altos, CA, and 
references cited therein; Coding (1986), Monoclonal Antibodies: Principles and 
Practice (2d ed.) Academic Press, New York, NY; and Kohler and Milstein (1975), 
Nature, 256:495-497. Such techniques include antibody preparation by selection of 
antibodies from libraries of recombinant antibodies in phage or similar vectors. See, 
Huse et a/. (1989), Science, 246:1275-1281; and Ward et a/. (1989), Nature, 
341:544-546. For example, in order to produce antisera for use in an immunoassay, 
the polypeptide of SEQ ID NO: 2, 4, 6, or 8, or a fragment thereof, is isolated as 
described herein. For example, recombinant protein is produced in a transformed 
cell line. An inbred strain of mice or rabbits is immunized with the protein of SEQ 
ID No. 2, 4, 6, or 8, or a fragment thereof, using a standard adjuvant, such as 
Freund's adjuvant, and a standard immunization protocol. Alternatively, a synthetic 
peptide derived from the sequences disclosed herein and conjugated to a carrier 
protein can be used as an immunogen. Polyclonal sera are collected and titered 
against the immunogen protein in an immunoassay, for example, a solid phase 
immunoassay with the immunogen immobilized on a solid support. Polyclonal 
antisera with a titer of 10* or greater are selected and tested for their cross reactivity 
against non-adenine methyltransferases or even other adenine methyltransferases, 
using a competitive binding immunoassay. Specific monoclonal and polyclonal 
antibodies and antisera will usually bind with a Kq of at least about .1 mM, more 
usually at least about 1 /vM, preferably at least about .1 /vM or better, and most 
preferably, .01 a/M or better. 

A number of immunogens may be used to produce antibodies 
specifically reactive with DNA adenine methyltransferases- Recombinant protein is 
the preferred immunogen for the production of monoclonal or polyclonal antibodies. 
Naturally occurring protein may also be used either in pure or impure form. 
Synthetic peptides made using the DNA adenine methyltransferase sequences 
described herein may also used as an immunogen for the production of antibodies to 
the protein. Recombinant protein can be expressed in eukaryotic or prokaryotic cells 
as described above, and purified as generally described above. The product is then 
injected into an animal capable of producing antibodies. Either monoclonal or 
polyclonal antibodies may be generated, for subsequent use in immunoassays to 
measure the protein. 
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Methods of production of polyclonal antibodies are known to those of 
skill in the art. In brief, an immunogen, preferably a purified protein, is mixed with 
an adjuvant and animals are immunized. The animal's immune response to the 
immunogen preparation is monitored by taking test bleeds and determining the titer 
of reactivity to the DNA adenine methyltransferase. When appropriately high titers 
of antibody to the immunogen are obtained, blood is collected from the animal and 
antisera are prepared. Further fractionation of the antisera to enrich for antibodies 
reactive to the protein can be done if desired. (See Harlow and Lane, supra). 

Monoclonal antibodies may be obtained by various techniques familiar 
to those skilled in the art. Briefly, spleen cells from an animal immunized with a 
desired antigen are immortalized, commonly by fusion with a myeloma cell (See, 
Kohler and Milstein, £ur. /. Immunol. 6:511-519 (1976), incorporated herein by 
reference). Alternative methods of immortalization include transformation with 
Epstein Barr Virus, oncogenes, or retroviruses, or other methods well known in the 
art. Colonies arising from single immortalized cells are screened for production of 
antibodies of the desired specificity and affinity for the antigen, and yield of the 
monoclonal antibodies produced by such cells may be enhanced by various 
techniques, including injection into the peritoneal cavity of a vertebrate host. 
Alternatively, one may isolate DNA sequences which encode a monoclonal antibody 
or a binding fragment thereof by screening a DNA library from human B cells 
according to the general protocol outlined by Huse, et a/. (1989) Science 246:1275- 
1281. 

A particular protein can be measured by a variety of immunoassay 
methods. For a review of immunological and immunoassay procedures in general, 
see Basic and Clinical Immunology 7th Edition (D. Stites and A. Terr ed.) 1991. 
Moreover, the immunoassays of the present invention can be performed in any of 
several configurations, which are reviewed extensively in Enzyme Immunoassay, E.T. 
Maggio, ed., CRC Press, Boca Raton, Florida (1980); "Practice and Theory of Enzyme 
Immunoassays," P. Tijssen, Laboratory Techniques in Biochemistry and Molecular 
B/o/ogy," Elsevier Science Publishers B.V. Amsterdam (1985); and Harlow and Lane, 
Antibodies. A Laboratory Manual, supra, each of which is incorporated herein by 
reference. 
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Immunoassays to methyltransferases of the present invention may use a 
polyclonal antiserum which was raised to the protein of SEQ ID NO: 2, A, 6, or 8, or 
a fragment thereof. This antiserum is selected to have low crossreactivity against 
other (non-methyltransferase or methyitransferase) proteins and any such 
crossreactivity is removed by immunoabsorbtion prior to use in the immunoassay. 

In addition, it is possible to produce monospecific antibodies that reaa 
to specific DNA methyltransferases from specific species of bacteria as identified 
herein. Monospecific antibodies are achieved by appropriate cross-absorption with 
select DNA methyltransferases or by raising antibodies against species specific 
regions of the amino acid sequence of the transferases. Such unique peptide 
fragments are routinely identified by sequence comparisons. 

In order to produce antisera for use in an immunoassay, the protein of 
SEQ ID NO: 2, A, 6, or 8, or a fragment thereof, is isolated as described herein. For 
example, recombinant protein is produced in a transformed cell line. An inbred 
strain of mice such as balb/c is immunized with the protein of SEQ ID NO: 2 using a 
standard adjuvant, such as Freund's adjuvant, and a standard mouse immunization 
protocol. Alternatively, a synthetic peptide derived from the sequences disclosed 
herein and conjugated to a carrier protein can be used as an immunogen. Polyclonal 
sera are collected and titered against the immunogen protein in an immunoassay, for 
example, a solid phase immunoassay with the immunogen immobilized on a solid 
support. Polyclonal antisera with a titer of lO"* or greater are selected and tested for 
their cross reactivity against non-adenine methyltransferases, using a competitive 
binding immunoassay such as the one described in Harlow and Lane, supra, at pages 
570-573. 

Immunoassays in the competitive binding format can be used for the 
crossreactivity determinations. For example, the protein of SEQ^MD NO: 2 can be 
immobilized to a solid support. Proteins (other methyltransferases, or non- 
methyltransferases) are added to the assay which compete with the binding of the 
antisera to the immobilized antigen. The ability of the above proteins to compete 
with the binding of the antisera to the immobilized protein is compared to the 
protein of SEQ ID NO: 2. The percent crossreactivity for the above proteins is 
calculated, using standard calculations. Those antisera with less than 10% 
crossreactivity with each of the proteins listed above are selected and pooled. The 
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cross-reacting antibodies are optionally removed from the pooled antisera by 
immunoabsorbtion with the above-listed proteins. 

The immunoabsorbed and pooled antisera are then used in a 
competitive binding immunoassay as described above to compare a second protein 
to the immunogen protein, in this case, the adenine methyltransferase of SEQ ID NO: 
2. In order to make this comparison, the two proteins are each assayed at a wide 
range of concentrations and the amount of each protein required to inhibit 50% of 
the binding of the antisera to the immobilized protein is determined. If the amount 
of the second protein required is less than 10 times the amount of the protein of SEQ 
ID NO: 2 that is required, then the second protein is said to specifically bind to an 
antibody generated to an immunogen consisting of the protein of SEQ ID NO: 2. 

The presence of a desired polypeptide (including peptide, transcript, or 
enzymatic digestion product) in a sample may be detected and quantified using 
Western blot analysis. The technique generally comprises separating sample 
products by gel electrophoresis on the basis of molecular weight, transferring the 
separated proteins to a suitable solid support (such as a nitrocellulose filter, a nylon 
filter, or derivatized nylon filter), and Incubating the sample with labeling antibodies 
that specifically bind to the analyte protein. The labeling antibodies specifically bind 
to analyte on the solid support. These antibodies are directly labeled, or alternatively 
are subsequently detected using labeling agents such as antibodies (e.g., labeled 
sheep anti-mouse antibodies where the antibody to an analyte is a murine antibody) 
that specifically bind to the labeling antibody. 

3. Purification of DNA adenine methyltransferases 

The polypeptides of this invention may be purified to substantial purity 
by standard techniques, including selective precipitation with suQh substances as 
ammonium sulfate, column chromatography, immunopurification methods, and 
others. See, for instance, R. Scopes, Protein Purification: Principles and Practice, 
Springer-Verlag: New York (1982), incorporated herein by reference. For example, 
the methyltransferase proteins and polypeptides produced by recombinant DNA 
technology may be purified by a combination of cell lysis (e.g., sonication) and 
affinity chromatography or immunoprecipitation with a specific antibody to 
methyltransferase. For fusion products, subsequent digestion of the fusion protein 
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with an appropriate proteolytic enzyme releases the desired polypeptide. The 
proteins may then be further purified by standard protein chemistry techniques. A 
specific protocol for purifying the methyltransferases of this invention is provided in 
Example 1(e). 

4. Screening for inhibitors of methyltransferase or associated gene 
expression 

The methyltransferase genes identified herein provide novel targets for 
screening for agents that attenuate, inhibit, or interfere with the viability of the 
pathogens bearing with the gene. Inhibition (/.e. blocking) or complete elimination 
of the expression of the methyltransferase gene or genes described herein results in a 
mitigation or elimination of the ability of the subject bacteria to infect and/or grow 
and/or proliferate in an animal or plant host as compared to the same stain of 
bacteria (or virus) in which there is no inhibition or elimination of the virulence- 
related gene or gene product. 

Having provided herein genes whose expression is required for 
viability of pathogenic bacteria, it is possible to screen for agents and/or for drugs 
that, by blocking the activity of the methyltransferase gene, mitigate the virulence of 
the target pathogen. 

Antibiotics and other synthetic drugs targeted to specific proteins 
generally act by interacting with and inhibiting the activity of the target protein. The 
methyltransferase enzymatic activity assays provided herein are useful to identify 
inhibitors of that activity. To do so, the enzymes capacity to methylate a nucleic 
acid is assayed in the presence and absence of a test substance, such as a synthetic 
or isolated naturally occurring chemical inhibitor (in particular peptides or other 
ligands that bind to the active site or to allosteric sites of the methyltransferase 
enzyme). An inhibitor of the transferase depresses the activity of enzyme at least 
50%, preferably at least 90%, and most preferably at least 99%. 

The methyltransferase genes or gene product (/.e., mRNA) is preferably 
detected and/or quantified in a biological sample. As used herein, a biological 
sample is a sample of biological tissue or fluid that, in a healthy and/or pathological 
state, contains methyltransferase encoding nucleic acid or the polypeptide. Such 
samples include, but are not limited to, sputum, amniotic fluid, blood, blood cells 
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(e.g., white cells), tissue or fine needle biopsy samples, urine, peritoneal fluid, and 
pleural fluid, or cells therefrom. For plants, root tissue or leaf tissue can be used. 
Biological samples may also include seaions of tissues such as frozen sections taken 

for histological purposes. 

The present invention encompasses developing antisense protocols, 
antibiotics and antagonists that specifically inhibit the methyltransferase activity of the 
identified enzymes or the expression of the genes of this invention. The detection 
and testing of such inhibitors is made possible by the ability to make and obtain the 
claimed enzyme using methods described herein. 

Antisense agents are used to reduce or eliminate methyltransferase 
activity. Antisense agents include fragments or the methyltransferase genes that are 
operably linked in reverse orientation to an efficient promoter. Also included in 
antisense agents are ribozymes such as the hairpin or hammerhead types. For 
antisense agents suitable assays involve detecting the presence, absence, or quantity 
or amount of transcript of the gene or gene product. Northern blots, quantitative 
PGR or immunoassays are all suitable for detection of the effectiveness of antisense 
agents. 

In still another embodiment, bacterial reporter strains are used to 
evaluate candidate anti-transferase agents. In such assays, recombinant bacteria are 
modified to include a reporter gene attached to a nucleic acid encoding the 
methyltransferase gene. When the genes are expressed, the reporter gene is also 
expressed and provides a detectable signal indicating the expression of the gene. 
Anti-methyltransferase agent screens then involve contacting the reporter strains 
and/or cells, tissues, or organisms prior to or after infection with the reporter strains 
and subsequently detecting expression levels of the reporter gene. 

In addition to screening for antisense agents, this invention provides for 
methods that facilitate the identification of non-antisense drug candidates especially 
under conditions of high throughput. The screening for such non-nucleic acid based 
inhibitory agents commonly involves contacting the target pathogen (e.g. Brucella 
abortus), and /or a tissue containing the pathogen, and/or an animal, with one or 
more candidate anti-methyltransferase agents and detecting the presence absence, 
quantity of the gene product. Alternatively, candidate anti-methyltransferase agents 
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can be identified sinnply by their ability to bind to the gene or gene product and 
inhibit its biological activity. 

Methods for detecting the biological activity of the methyltransferases 
are provided herein and include reaction conditions and suitable substrates for 
methylation. These assays can be used to screen for anti-methyltransferase agents. 
Absence of the activity of the gene during and/or after contacting of the bacteria, a 
cell, a tissue, and/or an organism with an anti-transferase agent of interest will 
indicate that the particular test compound is a likely candidate for an antibiotic. 

In view of the foregoing, preferred assays for detection anti- 
methyltransferase agents fall into the following categories: 

i) Detection of gene or gene-derived nucleic acid presence, absence, or 

quantity; 

ii) Screening for agents that bind to a gene or gene derived nucleic 

acid; 

iii) Detection of a virulence gene derived polypeptide; 

iv) Detection of binding of a prospective agent to gene derived 

polypeptides; 

v) Use of bacterial reporter strains; and, 

vi) Detection of the biological activity of the transferase gene. 

5. High-Throughput Screening of Candidate Agents that Block 
Methyltransferase Activity. 

Conventionally, new chemical entities with useful properties are 
generated by identifying a chemical compound (called a "lead compound") with 
some desirable property or activity, creating variants of the lead compound, and 
evaluating the property and activity of those variant compounds.. However, the 
current trend is to shorten the time scale for all aspects of drug discovery. Because 
of the ability to test large numbers quickly and efficiently, high throughput screening 
(HTS) methods are replacing conventional lead compound identification methods. 

In one preferred embodiment, high throughput screening methods 
involve providing a library containing a large number of potential therapeutic 
compounds (candidate compounds). Such "combinatorial chemical libraries" are 
then screened in one or more assays, as described herein, to identify those library 
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members (particular chemical species or subclasses) that display a desired 
characteristic activity. The compounds thus identified can serve as conventional 
"lead compounds" or can themselves be used as potential or actual therapeutics. 

a. Combinatorial chemical libraries 

Recently, attention has focused on the use of combinatorial chemical 
libraries to assist in the generation of new chemical compound leads. A 
combinatorial chemical library is a collection of diverse chemical compounds 
generated by either chemical synthesis or biological synthesis by combining a 
number of chemical "building blocks" such as reagents. For example, a linear 
combinatorial chemical library such as a polypeptide library is formed by combining 
a set of chemical building blocks called amino acids in every possible way for a 
given compound length {i.e., the number of amino acids in a polypeptide 
compound). Millions of chemical compounds can be synthesized through such 
combinatorial mixing of chemical building blocks. For example, one commentator 
has observed that the systematic, combinatorial mixing of 100 interchangeable 
chemical building blocks results in the theoretical synthesis of 100 million tetrameric 
compounds or 10 billion pentameric compounds (Gallop et a/. (1994) 37(9): 
1233-1250). 

Preparation and screening of combinatorial chemical libraries is well 
known to those of skill in the art. Such combinatorial chemical libraries include, but 
are not limited to, peptide libraries (see, e.g., U.S. Patent 5,010,175, Furka (1991) 
Int. }. Pept. Prot. Res., 37: 487-493, Houghton et a/. (1991) Nature, 354: 84-88). 
Peptide synthesis is by no means the only approach envisioned and intended for use 
with the present invention. Other chemistries for generating chemical diversity 
libraries can also be used. Such chemistries include, but are noLlimited to: peptolds 
(PCT Publication No WO 91/19735, 26 Dec. 1991), encoded peptides (PCT 
Publication WO 93/20242, 14 Oct. 1993), random bio-oligomers (PCT Publication 
WO 92/00091, 9 jan. 1992), benzodiazepines (U.S. Pat. No. 5,288,514), diversomers 
such as hydantoins, benzodiazepines and dipeptides (Hobbs et a/., (1993) Proc. Nat. 
Acad. Sci. USA 90: 6909-6913), vinylogous polypeptides (Hagihara et a/. (1992) J. 
Amer. Chem. 5oc. 114: 6568). nonpeptidal peptidomimetics with a Beta- D- Glucose 
scaffolding (Hirschmann et a/., (1992) /. Amer. Chem. Soc. 114: 9217-9218), 
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analogous organic syntheses of small compound libraries (Chen et aL (1994) /. Amer. 
Chem. Soc. 1 16: 2661), oligocarbamates (Cho, et ai., (1993) Science 261:1303), 
and/or peptidyl phosphonates (Campbell et a/., (1994) ). Org. Chem, 59: 658), See, 
generally, Gordon et aL, (1994) /. Med. Chem. 37:1385, nucleic acid libraries, 
peptide nucleic acid libraries (see, e.g., U.S. Patent 5,539,083) antibody libraries 
(see, e.g., Vaughn et a/. (1996) Nature Biotechnology, 14(3): 309-314), and 
PCT/US96/10287), carbohydrate libraries (see, e.g., Liang et a/. (1996) Science, 274: 
1520-1522, and U.S. Patent 5,593,853), and small organic molecule libraries (see, 
e.g.^ benzodiazepines, Baum (1993) C&EN, Jan 18, page 33, isoprenoids U.S. Patent 
5,569,588, thiazoiidinones and metathiazanones U.S. Patent 5,549,974, pyrrolidines 
U.S. Patents 5,525,735 and 5,519,134, morpholino compounds U.S. Patent 
5,506,337, benzodiazepines 5,288,514, and the like). 

Devices for the preparation of combinatorial libraries are commercially 
available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, 
Symphony, Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 
Plus, Millipore, Bedford, MA). 

A number of well known robotic systems have also been developed for 
solution phase chemistries. These systems include automated workstations like the 
automated synthesis apparatus developed by Takeda Chemical Industries, LTD. 
(Osaka, Japan) and many robotic systems utilizing robotic arms (Zymate I!, Zymark 
Corporation, Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif.) which 
mimic the manual synthetic operations performed by a chemist. Any of the above 
devices are suitable for use with the present invention. The nature and 
implementation of modifications to these devices (if any) so that they can operate as 
discussed herein wilt be apparent to persons skilled in the relevant art. In addition, 
numerous combinatorial libraries are themselves commercially available (see, e.g., 
ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, Inc., St. Louis, MO, 
ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, 
Columbia, MD, etc.). 
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b, Hieh throughput assays of ch ^^mical libraries 
Any of the assays for compounds inhibiting the virulence described 
herein are amenable to high throughput screening. As described above, having 
identified the nucleic acid associated with virulence, likely drug candidates either 
inhibit expression of the gene product, or inhibit the activity of the expressed 
protein. Preferred assays thus detect inhibition of transcription (/.e., inhibition of 
mRNA production) by the test compoundCs), inhibition of protein expression by the 
test compound(s), or binding to the gene (e.g., gDNA, or cDNA) or gene product 
ie.g.y mRNA or expressed protein) by the test compound(s). Alternatively, the assay 
can detect inhibition of the characteristic activity of the gene product or inhibition of 
or binding to a receptor or other transduction molecule that interacts with the gene 
product. 

High throughput assays for the presence, absence, or quantification of 
particular nucleic acids or protein products are well known to those of skill in the 
art. Similarly, binding assays are similarly well known. Thus, for example, U.S. 
Patent 5,559,410 discloses high throughput screening methods for proteins, U.S. 
Patent 5,585,639 discloses high throughput screening methods for nucleic acid 
binding (/.e., in arrays), while U.S. Patents 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially 
available (see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, 
OH; Beckman Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, 
etc.). These systems typically automate entire procedures including all sample and 
reagent pipetting, liquid dispensing, timed incubations, and final readings of the 
microplate in detector(s) appropriate for the assay. These configurable systems 
provide high thruput and rapid start up as well as a high degree_of flexibility and 
customization. The manufacturers of such systems provide detailed protocols the 
various high throughput. Thus, for example, Zymark Corp. provides technical 
bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 
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6. Methyitransferase activity. 

This protocol exemplifies a method for assaying for methyitransferase 
activity. It is a particularly good method because it allows for the detection of 
processivity but it need not be so used. 

A hemimethylated DNA substrate containing two (2) GANTC methylation 
sites, for example the N^60/66-mer described in Example 5(b) below^ is used to 
address the processivity of CcrM. The GANTC sites are resistant to Hinfl digestion 
but susceptible to Hindll digestion when hemimethylated. However, upon 
enzymatic methylation, the GANTC sites become fully methylated and resistant to 
Hindi! digestion. The methylation sites in the hemimethylated N^60/66-mer substrate 
are asymmetrically spaced so that DNA fragments of differing sizes are obtained 
upon Hindll digestion. Thus, one can address the preference for initial methylation 
by the enzyme during processive DNA methylation. 

The N^60/66-mer was 5'-labeled using T4 polynucleotide kinase and 
()^^P]-ATP according to the manufacturer's protocol (U.S. Biochemical). Unreacted 
[K^^P]-ATP and T4 polynucleotide kinase were separated from labeled duplex DNA 
by eluting the DNA through a 1-mL G-25 gel filtration column. Methylation assays 
were performed using 250 nM CcrM, 2 ^rM 5'-Iabeled N^60/66-mer, 6 a/M [^HJ-SAM 
in the appropriate reaction buffer at 30°C. 5 //L of reaction was quenched with 500 
fjL 10% perchloric acid, 200 /iL saturated sodium pyrophosphate, and 20 single- 
stranded DNA at times varying from 15 seconds to 20 minutes. These reactions 
were placed on ice for at least 30 minutes, and then were subjected to the filter 
binding assay monitoring [^HJ-CHj incorporation from [^H]-SAM into duplex DNA as 
described in Example 5. 

Concomitantly, 20 fjL reaction aliquots were quenched by either heat 
denaturation of CcrM or by the addition of 50 a/L phenoi/chloro/prm at times varying 
from 15 seconds to 20 minutes. The quenched reactions were then subjected to 
Hindll digestion. Typically, these reactions consisted of 10 /vL of the quenched DNA 
in a 20 //L reaction with the appropriate reaction buffer and 1 /iL of Hindll. After 
three hours of Hindll digestion at 37°C, 10 //L of this reaction was quenched with 
10 /yL of gel loading dye. DNA fragments were then resolved by 16% denaturing gel 
electrophoresis followed by Phosphorlmaging to identify cleavage patterns. 
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Results from the [^H]-SAM assay indicate that two mole equivalents of 
[^Hl-CHj were Incorporated into the N^60/66-mer after 20 minutes. By direct 
contrast, only one mole equivalent of l^HJ-CHj is incorporated into the N^23/30-mer 
or NM5/50-mer after 20 minutes under identical conditions. Results from the Hindll 
digestion assay reveal fully protected DNA substrate (N^60/66-mer) after 20 minutes, 
indicating that DNA had been methylated at both GANTC sites. Furthermore, no 
intermediate products were obtained, i.e., methylation at a single GANTC site. 
Indicating that under the assay conditions used the enzyme processively methylated 
both GANTC sites on the same DNA substrate. Approximately 250 nM of 
processively methylated DNA was detected after Phosphorlmaging quantitation, 
consistent with results from the tritium incorporation assay. 

EXAMPLES 

The examples provided herein are provided by way of illustration only and 
not by way of limitation. Those of skill will readily recognize a variety of noncritical 
parameters which could be changed or modified to yield essentially similar results. 

Example 1. SEQ ID NO:l: Rhizobium methyltransf erase sequence 
a. Isolation 

The Rhizobium meliloti ccrM gene (Rhizobium ccrM) was isolated by 
generating specific probes to Rhizobium ccrM using the Polymerase Chain Reaction 
(PGR) and using them to screen a R. me/»7ot/ lambda library. The primers used to 
generate the probe had the following sequence: 

Fonvard primer (IFADDPPY): 5'-ATY TTY GCB CAY CCB CCB TA (SEQ ID NO:9) 

Fonvard primer 1 (LDPFFG): S'-CCR AAR AAV GGR TCS AG {SEQ ID NO:10) 

Fon«^ard primer 2 (IGIERE): 5'-TCV CGY TCR ATV CCR AT (SEQ ID NO:ll> 

Forward primer and reverse primer 1 amplify a 570 bp fragment. Forward primer 

and reverse primer 2 amplify a 635 bp fragment. The R. meliloti lambda library was 

obtained and subsequent screening was accomplished as described in Sambrook et 

a/. 

Three positive clones were isolated from the library. The complete 
Rhizobium ccrM gene was isolated as a 3.0 kb NotI fragment and has been 
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completely sequenced in both directions (SEQ ID NO:1). The gene encodes a 
protein having SEQ ID NO:2. 

b. Homology between the Caulobacter and Rhizobium ccrM 
methyltransferase genes 

The deduced sequences of the Rhizobium and Caulobacter ccrM genes 
were compared, revealing 61% identity and 74% similarity. Figure 9. The 
homology is present throughout the two sequences, particularly around regions 
which had been previously identified as important to the function of other known 
adenine DNA methyltransferases. However, there are regions of divergence, 
especially around the N- and C- termini. 

The DNA methyltransferase M. Hinfl from Haemophilus influenzae has 
the same recognition sequence (GANTQ as CcrM and is part of a restriction 
modification system in this bacteria (Chandrasegaran et aL, Cene 70:387-392, 1988). 
It should be noted that H. influenzae is not part of the alpha subdivision of gram 
negative bacteria and therefore it is likely that this DNA methyltransferase evolved 
separately from the ccrM family. The deduced sequences derived from the 
Rhizobium and Caulobacter ccrM genes were compared to the M. Hinfl sequence 
and it was found, as predicted, that the Caulobacter and Rhizobium genes are much 
more closely related to each other than to the M. Hinfl DNA methyl-transferase. 
% similarity between the Rhizobium (Rh), Caulobacter (Cc) 
Brucella, Hp - Helicobacter pylori and M. Hinfl (Hf) CcrM proteins 

Cc Rh Br Hf Hp 
Cc 100 74 82 66 57 
Rh 90 64 53 

Br 66. 54 

Hf 71 

c. Rhizobium ccrM is essential in Rhizobium 

Previous work by Stephens et aL, Proc. Natl. Acad. 5c/. 93:1210-1214, 
(1996) has demonstrated that the Caulobacter ccrM is essential for viability in 
Caulobacter. Therefore it is of interest to determine whether other ccrM homologs 
are also essential. 
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The coding sequence of the Rhizobium ccrM was disrupted by 
insertion of the gene encoding kanamycin/neomycin resistance (a selectable marker) 
into the middle of the gene. This construct was cloned into a suicide plasmid that 
under selection integrates into the Rhizobium ccrM locus. The result of this 
integration is that the wild-type copy is separated from the disrupted copy by the 
vector sequence, which includes the sacB gene. Growth of Rhizobium containing an 
active sacB gene on sucrose is lethal (Hynes et a/.. Gene 78:1 1 1-120, 1989), This 
eocdales selection for the second recombination event between the disrupted and 
wild-type copy of ccrM by growth on sucrose. Selection for the event in which only 
the disrupted copy remained at the ccrM locus occurred only in the presence of a 
functional copy of ccrM on a replicating plasmid. Thus the Rhizobium ccrM gene is 
essential for viability in Rhizobium. 



Strain Plasmid ccrM::nptll ccrW+ 

LS2590 none 0 300 

LS2591 none 0 300 

LS2590 pMB440 0 300 

LS2591 pMB440 0 300 

LS2590 pRW175 (ccrM+) 145 105 

LS2591 pRWI 75 (ccrM + ) 192 58 



The Rhizobium ccrM locus can only be disrupted if ccrM is present in trans. 

d. Overexpression of the Rhizobium ccrM gene results in defects 
in cell division and cell morphology 

Caulobacter goes to great lengths to ensure that CcrM is presently only 
at a specific time of the cell cycle, by regulating the availability of CcrM at two 
levels: transcription and protein turnover (Stephens et a/., /. Bacterioi. 177:1662- 
1669, 1995; Wright et a/., Genes and Development 10:1532-1542, 1996). If this 
regulation is perturbed by expressing ccrM throughout the cell cycle, the cells exhibit 
defects in cell division, cell morphology, and the initiation of DNA replication 
(Zweiger et a/., /. Mo/. BioL 235: 472-485, 1994; Wright et a/.. Genes and 
Development 10:1532-1542, 1996). Thus it is important to ensure that CcrM is only 
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present in predi visional stage of the Caulobacter cell cycle. We were therefore 
interested to determine what would happen if the Rhizobium ccrM gene were 
expressed at high levels in Rhizobium, 

The 3.0 kb NotI fragment encompassing the Rhizobium ccrM gene was 
ligated into a high copy number plasmid and this construct was mated into wild-type 
Rhizobium, The phenotype of the resulting strain is clearly abnormal compared to 
wild-type. Wild type Rhizobium is a short rod-shaped cell; however, the cells of the 
strain in which ccrM was overexpressed are much larger and are highly branched. 
The branching points appear to occur randomly and vary dramatically between cells. 
This phenotype is similar to that observed when the cell division gene ftsZ is 
overexpressed in Rhizobium (B. Margolin, personal communication). 

Interestingly, if the Rhizobium ccrM gene is placed in the high copy 
number plasmid such that it is driven by an additional promoter from the plasmid, no 
transformants were obtained in Rhizobium. This suggests that the cells can tolerate, 
to a certain extent, an elevated level of CcrM, but there is a point at which the level 
of ccrM in the cell becomes lethal. 

As CcrM is only present at a specific time in the Caulobacter cell cycle, 
hemimethylated DNA can be detected in mixed cell cultures. When ccrM is 
expressed throughout the cell cycle, whether in a Ion null mutant or from expression 
from a constitutively transcribed promoter, only fully methylated DNA can be 
detected. It was of interest to determine whether hemimethylated DNA could be 
detected in Rhizobium, which would suggest that the Rhizobium ccrM is also cell 
cycle regulated- A naturally occurring restriction site which overlaps a Hinfl site and 
is sensitive to adenine methylation was identified in Rhizobium. The DNA 
methylation state at that site was determined and hemimethylated DNA was 
detected. For a detailed explanation of this experiment see-Zw^iger et a/., /. Mo/. 
Biol, 235: 472-485, (1994). The detection of hemimethylated DNA could be due to 
either protection from being methylated by a protein binding at that site or the 
Rhizobium CcrM being present only at a specific time in the cell cycle. 

e. Enzyme purification 

BL21(DE3) hosting pCS255b was streaked from glycerol stock onto an 
SB (30 g tryptone, 20 g yeast extract, 10 g MOPS, pH 7.5) agar plate containing 200 
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Afg/ml amp, and maintained at 37°C. Each 1 L SB/amp (200 fjg/mL) culture was 
inoculated with one single colony at 37°C until ODj,oo-0.8. Each cell culture was 
then induced with 0.5 mM IPTC at 37°C for 1.5-2 hours. 

The cells were harvested by centrifugation at 12000 rpm at 4°C for 20 
minutes. Approximately 20 grams of cell paste was obtained from 5 liters of culture. 
The cells were resuspended in a 25 mM HEPES, pH 7.5, 1 mM EDTA, 5 mM p- 
mercaptoethanol, 1 mg/mL lysozyme, and 0.1% PMSF 10% glycerol, and lysed by 
sonication using a 50% duty cycle. The process involved sonicating for 30 seconds, 
stirring the cells for 90 seconds, and repeating the process until the solution was very 
viscous. This solution was then centrifuged at 12,000 rpm for 20 minutes at 4*'C, 
followed by uitracentrification at 40,000 rpm at 4°C for 2 hours. 

The supernatant was diluted 5-fold with Buffer A (25 mM HEPES, pH 
7.5, 5 mM P-ME, 1 mM EDTA, 10% glycerol) and applied to a 30 x 2.5 cm DEAE- 
Sephacel connected to a P11 phosphocellulose column pre-equilibrated with 1 L of 
buffer A. CcrM does not bind to DEAE-Sephacel while 90% of the proteins from the 
cell lysate do. The two connected columns were washed with 500 mL buffer A. 
The PH column was then disconnected from the DEAE column and eluted with a 
linear gradient of 1 L buffer A with 25 to 750 mM NaCI. CcrM was eluted at -300 
mM NaCI. Fractions were collected and analyzed for protein content by Abs280 as " 
well as by SDS-PAGE. 

After elution of the protein from the phosphocellulose column, the 
enzyme was concentrated using an Amicon apparatus employing a YM-30 molecular 
weight cut-off membrane. After concentration, the protein was determined to be 
>95% pure based upon SDS-polyacrylamide gel electrophoresis. The concentration 
of the protein was first measured using the Bradford colorimetric technique (Bradford, 
Anal. Biochem. 72, 248-254 (1976)). The second method for determining the 
concentration of CcrM utilizes measuring the ultraviolet-visible spectroscopy 
absorbance of the protein at a wavelength of 280 nm. The extinction coefficient of 
the protein was determined from the predicted amino acid composition (Zweiger et 
a/., ;. Mo/. Biol. 245, 472-485 (1994)) using the method of Gill and von Hippel Anal. 
Biochem. 182, 319-326 (1989)). The concentration of CcrM based upon this method 
is in excellent agreement with the concentration based on the Bradford method. 
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f. Rhizobium CcrM is degraded in a Lon protease-dependent 

process as has been shown in Caulobacter (Wright ef a/.. Genes 
and Development 10:1532-1542, 1996). 

Lon is a conserved phylogenetically widespread serine protease 
involved in the degradation of abnormal proteins. We generated a Lon null mutation 
in Caulobacter crescentus and demonstrated that ccrM transcription is still temporally 
regulated, but that it is present throughout the cell cycle, resulting in.a fully 
methylated chromosome throughout the cell cycle, causing developmental defects 
(Wright et a/., Genes and Devehpment 10:1532-1542, 1996). Using similar 
methods as described in Wright et a/., we expect that Rhizobium CcrM is degraded 
in a Lon protease-dependent process as has been shown in Caulobacter. 

Example 2. Brucella abortus methyltransferase sequence 

The Brucella ccrM gene was isolated using the same strategy and 
primers as that described for isolating the Rhizobium ccrM gene, but using a Brucella 
gene library. A specific probe to the Brucella ccrM gene generated by PGR using the 
above mentioned primers was used to screen a Brucella lambda library and three 

clones were isolated. 

Restriction mapping of these clones demonstrated that they all 
contained the full length ccrM gene. A 2.0 kb HindU fragment isolated from one of 
the positive clones which contained the complete Brucella ccrM gene was sequenced 
(Figures 3 and 4). As with the Rhizobium ccrM gene, the deduced sequence of the 
Brucella gene exhibits very high homology to both the Caulobacter and Rhizobium 
ccrM genes and lower homology to the M. Hinfl DNA methyltransferase (Figures 9). 

Example 3. Agrobacterium fume/aciens methyltransferase sequence 

The Agrobacterium tumefaciens ccrM gene was isolated using the same 
strategy as that described for isolating the Rhizobium and Brucella ccrM gene, but 
using an Agrobacterium gene library. A partial gene and protein sequence are 
summarized in Figs. 5 and 6. 
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Example 4. Helicobacter pylori methyl transferase sequence 

Helicobacter pylori is a small, microaerophilic Gram-negative organism 
which can colonize the human stomach. It is a causative agent of chronic gastritis 
and peptic ulcer disease, and H. pylori infection has also been epidemiologically 
correlated with increased risk of gastric carcinoma and lymphoma. 

H. pylori belongs to the epsilon subdivision of proteobacterra, and is 
thus evolutionarily separated from Caulobacter crescentuSr Rhizobium meliloti, and 
Brucella abortus, all of which belong to the alpha subdivision. 

The gene for the H. pylori homolog of CcrM has been cloned and 
sequenced. Unlike the other ccrM homoiogs cloned so far, the H. pylori gene has a 
large open reading frame located immediately downstream. The sequencing of this 
open reading frame is still in progress. There is high homology between the H. 
pylori CcrM homolog and the M.Hinfl methyltransferase from Haemophilus 
influenzae. Because there is extensive precedent for finding close genetic linkage 
between methyltransferases and their cognate restriction endonucleases in Type II 
restriction-modification systems such as Hinfl, it is likely that this open reading frame 
encodes a restriction endonuclease. 

Because of the function of methyltransferases in such restriction- 
modification systems (i.e. protecting native host DNA from digestion by the cognate 
restriction endonuclease), it is also likely that absence of the functional 
methyltransferase will prove lethal to H. pylori. 

The Helicobacter pylori ccrM gene was isolated using the same strategy 
as that described for isolating the above ccrM genes, but using a Helicobacter library. 
The gene and protein sequence are provided in Figs. 7 and 8. 

Example 5. Assay for methyltransferase 

The present invention also comprises efficient assays for determining 
methyltransferase activity. 

a. Materials 

[^HJ-S-Adenosyl methionine ([^H]-SAM), [k-"P1ATP, and [a-"P]-dATP 
were from New England Nuclear. Phosphoramidites for DNA synthesis were 
obtained from Glenn Research with the exception of the N^-methyl-deoxyadenosine 
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- phosphoramidite which was obtained from Pharmacia. Restriction and DNA- 
modifying enzymes used during molecular cloning and DNA manipulation 
experiments were generally from New England Biolabs, Promega, United States 
Biochemical, or Boehringer Mannheim. All other materials were obtained from 
commercial sources and were of the highest available quality. 

The CcrM used in the following assays was obtained by the purification 
protocol described essentially in Example I.e. 

b«. tn vitro assays 

Methyltransferase activity of CcrM was assayed by two distinct 
methods. In the first method, restriction assays were used to test methylation of 
restriction sites. The amount of DNA that is resistant to cleavage by restriction 
enzyme digest due to hemi- or full methylation of either the small DNA substrate or 
the pUCIS plasmid can be accurately monitored. If the DNA is hemi- or fully 
methylated by CcrM, the restricted enzyme is unable to cleave the DNA molecule 
and full length starting material will be obtained. If the DNA is cleaved by the 
restriction enzyme, smaller DNA fragments will be obtained and indicate a lack of 
methyl incorporation into the oligonucleotide. 

The sequences of the DNA substrates were derived from the upstream 
sequence from the dnaA promoter. The sequence of the dnaA promoter has been 
published (Zweiger et ai, }. Moi BioL 235: 472-485, 1994). 
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The following is a list of substrates that were used (this list is not meant to be 
exhaustive): 



17/23 mer DNA substrate: (SEQ ID NO: 12) 

5 ' actCQcaa gtcaac aaa 3 * 
3 ' aaQcqct caattq t cc 1 1 a tcgg 5 ' 

23/30.mer (SEQ ID NO: 13) 

5'- TCC TCT CCC <»G TCA AC A GAA AT 
3'- AGG AGA GCG CTC ACT TGT CTT TAT AGO CGC 

N^23/30.mer (SEQ ID NO: 14) 
CH3 

5*. TCC TCT CGC GAG TCA ACA GAA AT 

AGG ACA GCG CTC ACT TGT CTT TAT AGG CGC 

N«23/N*30.mer (SEQ ID NO: 15) 

5'- TCC TCT CCC 
3'- AGG AGA GCG 

4S/S0.nier (SEQ ID NO: 16) 

5'«ATC CTC TCG CGA CTC AAC AGA AAT ATC CGC TCA TCA CCC CAA GTT 

3'- AG GAG AGC GCT CAG TTG TCT TTA TAG GCG ACT ACT CGC GTT CAA TAG CCA A 



CH3 

GXG TCA ACA CAA AT 

CTC AGT TGT CTT TAT AGG CGC 



N645/50-mer (SEQ ID NO: 17) 

5 ' -ATC CTC TCG CGA CTC AAC AGA AAT 
3'- AG GAG AGC GCT CAG TTG TCT TTA 



ATC CGC TCA TCA CCC CAA GTT 

TAG GCG AGT AGT GGC GTT CAA AAC CCA A 



60/66*iner (SEQ ID NO: IB) 

5 '-ATC CTC TCG CGA CTC AAC AGA AAT ATC CGC CAG TCA CCG CAA GTT TTC CGT TTG ACC CCC 

3'- AC GAG AGC GCT CAG TTC TCT TTA TAG CCC CTC ACT CGC CTT CAA AAC GCA AAC TCG CCG TGG GAG G 



N660/66.mcr (SEQ ID NO: 19) 

5* -ATC CTC TCG CCX CTC AAC AGA 
3'- AG GAG AGC GCT CAG TTG TCT TTA TAG CCC CTC ACT GGC GTT CAA AAC CCA AAC TGG CCG TGG GAG 



CH3 CH3 
5* -ATC CTC TCG CCX CTC AAC AGA AAT ATC CGC dlc TCA CCG CAA GTT TTC CGT TTC ACC GGC 
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All synthetic oligonucleotides were synthesized using a DNA synthesizer and were 
purified as previously described by Capson et a/., Biochemisiry 31, 10984-10994 
(1992)). Small duplex DNA substrates (23/30-mer) were prepared by the protocol of 
Kuchta et a/.. Biochemistry 26, 8410-841 7 (1987)). 

Larger DNA substrates (60/66-mer and N^60/66-nner) were prepared 
using a modification of the protocol established by Kaboord and Benkovic, Proc. 
Natl. Acad. Sci. USA 90. 10881-10885 (1993). Briefly, each single-strand DNA 
component was constructed by first 5' labeling one oligonucleotide. After ensuring 
that the labeling reaction was greater than 95% complete, the labeled 
oligonucleotide was annealed with the second oligonucleotide and a small linker 
oligonucleotide to bridge the gapped region. The two oligonucleotides were then 
ligated in the presence of T4 DNA ligase and MgATP. The linker oligonucleotide 
was separated from the ligated oligonucleotide by denaturing gel elertrophoresis. 
The complementary large strand was constructed in an identical manner. Following 
purification of each respective large oligonucleotide, the two strands were annealed 
and purified by nondenaturing gel electrophoresis described by Capson et a/., 
Biochemistry 31, 10984-10994 (1992). All duplex DNA were quantitated as 
described by Kuchta et a/.. Biochemistry 26, 8410-841 7 (1987). 

Analysis of DNA cleavage depends upon the nature of the DNA 
substrate. Small duplex DNA substrates can be 5' end-labeled using bacteriophage 
T4 polynucleotide kinase and [^-"PIATP as the phosphate source. Both cleaved and 
uncleaved DNA are resolved by 20% denaturing gel electrophoresis followed by 
phosphorimaging techniques to analyze for product formation, i.e., cleavage of the 
larger duplex DNA. Furthermore, accurate quantitation of the reaction products was 
obtained by manipulation of the Phosphorlmager software. 

A typical assay for the methyltransferase activity olCcrM was 
performed incubating 50 nM CcrM with 1 jjM 5'-labeled DNA while maintaining the 
concentration of S-adenosyl methionine (SAM) at 20 /^M. The reaction was 
performed in a buffer consisting of 50 mM Tris-HCl, pH 7.5 and 5 mM p- 
mercaptoethanol U3-ME) with 150 mM potassium acetate at 30°C. 10 fjl aliquots of 
the methylation reaction were quenched at variable times from 30 seconds to 10 
minutes with 10 //L 1 N HCI, extracted with 40 fjl of phenol/chloroform, and 
neutralized with 3 M NaOH in 1 M Tris. The methylated DNA was then subjected 
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to restriction digest by either Hinf\ or H/ndll. Each reaction contained a final 
concentration of 100 nM reacted DNA in the presence of 1 unit//yL of Hin{\ or H/ndll 
in the appropriate reaction buffer supplied by the nnanufacturer at 37°C. After 30 
minutes, 10 //L of reaction mixture was quenched with 10 /^L of gel loading buffer 
(10% formamide, 0.2 5 7o bromophenol blue, and 0.25% xylene cyanol FF). 10 /jl of 
this solution was then run on a 20% sequencing gel to visualize both protection and 
degradation of the 23/30-mer DNA as a function of time. Product formation was 
quantitated by measuring the ratio of uncieaved substrate and cleaved product- The 
ratios of substrate protection are corrected for substrate in the absence of CcrM. 
Corrected ratios are then multipHed by the concentration of total DNA used in each 
assay to yield the amount of DNA protected. 

Enzymatic assays were also performed using plasmid pUC18 DNA 
substrate under similar reaction conditions described above. Reaction products using 
the larger pUC18 substrate were resolved by agarose gel electrophoresis (1% agarose 
gels). Cleaved and uncieaved DNA are easily visualized under ultraviolet light after 
staining the gel with 0.5 ;yg/mL of ethidium bromide. Quantitation of the reaction 
products for kinetic analysis were performed by densitometry measurements. 

A second method involves direct measurement of the incorporation of 
[^H1-CH3 from [^H]-SAM into DNA. A typical assay consists of 250 nM CcrM, 5 //M 
DNA (hemi- or unmethylated) and 6 fjM [^H]-SAM in the appropriate reaction buffer. 
5 jjL aliquots of the reaction are quenched in solution containing 500 /jL 10% 
perchloric acid, 200 /yL saturated potassium pyrophosphate, and 20 //L 1 mg/mL 
single-stranded DNA at times ranging from 15 seconds to 30 minutes. The quenched 
samples are placed on ice for 30 minutes to precipitate all DNA. The precipitated 
DNA is then recovered by filtration using glass fiber filters and washed, first with 
cold 0.1 N HC! (five times with 1.5 mL) and then with cold 95%.ethanol (four times 
with 1.5 mL). The filters are then dried at 90°C for 10 minutes and counted by 
standard liquid scintillation techniques. The specific activity of the reaction is 
determined by measuring the counts per minute present in a fixed quantity of the 
original reaction in the absence of washing. 

Specific activity (SA) was determined by measuring the CPMs present in 
5 jjL of original reaction. SA = CPMs/pmol SAM. The amount of methyl 
incorporation was determined as follows: 
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rero' 

— = pmol product 

Specific Activity 

The amount of methyl incorporation into the DNA substrate is determined by 
dividing the counts per minute of the washed reaction samples by the specific 
activity of the total reaction mixture. This yields product formation in terms of mole 
quantities. All data are corrected for nonspecific binding of [^H]-SAM to the washed 
filter. 

Alternatively, following the enzymatic incorporation of {^HJ-CHj from 
[^H>SAM into DNA, a 5 pi aliquot of the reaaion is spotted at variable times onto 
DES anion-exchange filter paper. The filters are then washed 3 times for 10 minutes 
with 200 mL 0.3 M ammonium formate, pH 8 to remove unreacted [^H]-SAM. The 
filters are then briefly washed twice with 95% ethanol and then washed once with 
anhydrous ether. The filters are then air dried and counted by standard liquid 
scintillation techniques. The specific activity of the reaction is determined by 
measuring the radioactivity present in 5 f/l of the reaction spotted on glass filter fibers 
without washing. The amount of methyl incorporation into the DNA substrate is 
determined by diving the counts per minute of the washed samples by the specific 
activity of the total reaction mixture, yielding product formation in terms of pmol 
quantities, all data are corrected for nonspecific binding of [^H]-SAM to the washed 
filter. 

During the course of performing the above assays, it was observed that: 
the N^-23/30-mer NM5/50-mer, and the N^-60/66-mer are preferred substrates by 
ratios of 10:1 and 2:1; the tested methyttransferases are processive under the assay 
conditions used; optimal activity was at 30° C rather than 37^ C; and the tested 
enzymes are DNA-dependent (/.e., they become inactivated in the solutions used 
after about 20 minutes in the absence of DNA substrate). The Toss of activity in the 
absence of a substrate does not appear to involve proteolytic degradation. 

c. In vivo assay 

A single colony of BL21(DE3) or DH5a hosting pCS255b was used to 
inoculate a 5 mL SB/amp (200 A/g/ml) overnight culture at 37^C. The BL2I(DE3) 
culture was divided into two aiiquots at OD^qq-I. One aliquot was induced with 1 
mM IPTG at 37°C overnight while the other was allowed to grow without 
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induction. Cell cultures were centrifuged, from which cell pellets were subjected to 
mini plasmid prep. The recovered piasmids from DH5a and BL2I(DE3) (with and 
without IPTG induction) were digested with Hinfl and the restriction digests were 
analyzed by 1% agarose gels. In all cases, controls containing the undigested 
plasmid were included. Plasmid recovered from DH5a was susceptible to Hinfl 
digestion while piasmids from BL2I {DE3) with and without induction were resistant 
to Hinfl digestion. It appears that even uninduced BL21(DE3) expresses ccrM. To 
ascertain that BL21(DE3) did not have intrinsic methyltransferase specific for the 
GANTC sites, pUCIS was introduced into BL21(DE3). pUClS recovered from 
BL21(DE3) was susceptible to Hinfl digestion, thereby excluding the possibility of 
BL21{DE3) host cells containing intrinsic M. Hinfl methyltransferase activity. 

Although the foregoing invention has been described in some detail by 
way of illustration and example for purposes of clarity of understanding, it will be 
obvious that certain changes and modifications may be practiced within the scope of 
the appended claims. All publications, patents and patent applications mentioned in 
this specification are hereby incorporated by reference for all purposes, to the same 
extent as if each individual publication, patent or patent application had been 
specifically and individually indicated to be incorporated by reference. 
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TABLE 1 

Alignment of the AgTob^ccezrium tuznefacierLS (At ) , 
Brua&lla, aJDortus (Ba) Rhizobium mBlHotzl (Hm) , 
CAulobacter crescBzitus (Cc) and Helicobacter- pylori (Hp) 
CcrM DNA methyl transferase homologs 

At IFADPPYNLQZ/XNVHRP 

Ba M5LViaJU{EZjPI£AFRTAWU3SIIlCGDCVSAZ«£ia^XmSVI^ 

RA MSSWSIJUEIXSRAARPUWl^SXXKGDiCVAAXiNAL^^ 

Ce MRFCPCTX XHGDCXEQMNALPEIC5VDLIFADPFYNI«Q£JSGDIXRP 

hp MDFIiKENXiNTXXEGDCLEXIJCDPPNKSVDrXFWPI^ 

At DQSX«VDAVTOEWDQFASFX3AYIX^FTItAWUi;^^ 

Ba I^SMVSAVDDKWDQFESFQAYDAFTXtAWLIJUZRRVLI^^ 

Ite XDQSLVDAVDDDWX)QFASFEAYDAPTRMia^^ 

Cc DNSKVDAVDDHWDQFBSPAAYOKFTHEHIiI^^ 

Hp EGTKFQCSsm>amKFGSFEEYim 
, . ♦ 

At LDFmUi 

Ba LCFWIXNDrVWRKTNPMPNFRGRRFONAHETLXWASREgXCIXryT^ 

Rm UtrWVLNDX IWRKTQPDAKI.QCSUlFQNAHBTLXWATANAKAKCYTFMyEftKK^^ 

Cc IXSPWXLNDr^^WRKSmiMPNFKCTRFMIAHST^ 

Hp LGFWXLNDXVmiCSNPVPNFAGKRLCBIA^ 

* .IS * * : i • *22 

Ba RSDWIJF'PICTQSEMJmElKroKVHPTQKPEAI^IJ^IMMAS 
Bn RSDWUPPXCSCSERIjKODDCKKVHPTQKPEM-IJtfllX^^ 
CC RSPWrXPI^CTCEERXKCaUXSQICAHPTO 

Hp KSVWQXPICMCNERliKDAQCKKVKSTQKPEALIMCIILSATKPro 

* '.♦♦i*. 2 srixjs:** ««*w* . 



Ba AKRLCRHFVGIEREQPYIDAATARIiaVEPLCKAELTVMTOKI^^ 
Kin AKRLGWiFVGXEHEQDYXDAAAERIAAVEPLGKATLSVMTK^^ 
Cc AKRI/SRKFIGIEREAEXI-EHAKARXAKVVPXAPEDU^^ 

Hp AKSMNRY FICIEKDSFY IKEAAKRLNSTRDKS -DFXTNIiDIiETKPPKIPMSIiI-ISKgliLK 

• *j - . s ; i •:s-r *: 

Ba PGTVLCDERRRFAAIVRAIXSTLTAN-CEAGSIHRICARVQGFnACI^^ 

Rm PCT^n[*TDAXRRYSAXVRAIxmJ^G-CEACSXHRLGAKVQCU»C^ 

Cc PGDTI.YCSKGTHVAKVRPDGSXTVC-DLSCSXHKIGAI*VQSAPAaaCW1^^ 

Hp iGDFLYSSimEKXCQVLENCQVRDNEMYETSIHKMSAKYIiNI^^ 

Ba PXDALRXXXKEQMAAAGA 
Bm PXDELRSVXRNDLAXLN 
Cc FXXarVXiBAQVRAGMII 
Hp £iLDELRYXCQRDS 

Note: ♦ indicates the identical residue is present in all rive sequences 
: or - indicates the axDino acid at that position is conserved in all 
sequences - 
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1. An isolated nucleic acid encoding a methyltransferase wherein 
said methyltransferase has a molecular weight of about 30-45 kilodaltons and binds 
to a polyclonal antibody that specifically binds to a polypeptide from the group of 
polypeptides having SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, and SEQ ID NO:8.. 

2. An isolated nucleic acid according to Claim 1 that encodes a 
Rhizobium melUoti DNA methyltransferase (SEQ ID NO:2). 

3. A nucleic acid of Claim 2, wherein the nucleic acid comprises 
SEQ ID NO:l. 

4. A nucleic acid of Claim 2 contained in a genetically engineered 

cell. 

5. An isolated protein encoding a methyltransferase wherein said 
methyltransferase has a molecular weight of about 30-45 kilodaltons and binds to a 
polyclonal antibody that specifically binds to a polypeptide from the group of 
polypeptides having SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, and SEQ ID NO:8. 

6. An isolated DNA adenine methyltransferase of claim 5 wherein 
said methyltransferase has the amino acid sequence provided in SEQ ID NO:2. 

7. An isolated nucleic acid according to Claim 1 that encodes a 
Brucella abortus DNA methyltransferase (SEQ ID NO:4). 

8. An isolated nucleic acid of Claim 7, wherein the nucleic acid 
comprises SEQ ID NO:3. 



9. A nucleic acid of Claim 7 contained in a genetically engineered 

cell. 
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10. An isolated DNA adenine methyl transferase of claim 5 having 
SEQ ID NO:4. 

11. An isolated nucleic acid according to Claim 1 that encodes an 
Agrobacterium tumefaciens DNA methyltransferase comprising SEQ ID NO:6. 

12. An isolated nucleic acid of Claim 11, wherein the nucleic acid 
comprises SEQ ID NO:5. 

13. A nucleic acid of claim 1 1 contained in a genetically engineered 

cell. 



14. An isolated DNA adenine methyltransferase of claim 5 wherein 
said methyltransferase has the amino acid sequence provided in SEQ ID NO: 6. 

15. An isolated nucleic acid according to Claim 1 that encodes a 
Helicobacter pylori DNA methyltransferase (SEQ ID NO:8). 

16. An isolated nuclieic acid of Claim 15, wherein the nucleic acid 
comprises SEQ ID NO: 7. 

17. A nucleic acid of claim 15 contained in a genetically engineered 

cell. 



18. An isolated DNA adenine methyltransferase having SEQ ID 

NO:8. 

19. A nucleic acid of claim 1 that encodes a processive 
methyltransferase that methylates a first site in a DNA substrate and then a second 
site in the DNA substrate without dissociating from the DNA substrate between the 
time of methylation of the first site and the time of methylation of the second site. 
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20. An efficient assay for methyltransferase activity, comprising the 

steps of: 

a) contacting a processive methyltransferase with 

1) a substrate selected from the group consisting of: 
CH3 

5 ' atcctctcgcg* agtcaacagaaa 

y aggagagcgc tcagttgtctttataggcgc; 

CH3 

5' atcctctcgcg*agtcaacagaaatatccgctcatcaccgcaagtt 

3' aggagagcgc tcagttgtctttataggcgagtagtggcgttcaaaaggca; and 

CH3 

5 ' atcctacgcg* agtcaacagaaatatccgcgagtcaccgcaagttttccgti tgaccggc 

3' aggagagcgc tcagttgtctttataggcgctcagtggcgttcaaaaggcaaactggccgtgggagg; and 

b) further contacting said processive methyltransferase with a methyl 
donor prior to or at the same time as the addition of the DNA 
substrate, 

wherein the methyltransferase methylates the DNA substrate. 

21. An assay according to Claim 20, wherein the methyl donor is S- 
adenosyi methionine. 

22. An assay according to Claim 20, wherein the assay is performed 
at 30*^ C or 3^ C. 

23. An assay according to Claim 20, wherein the assay is performed 
in the presence of 1 50 mM potassium acetate. 

24. An assay for screening for inhibitors of DNA methyltransferase 

activity that comprises: 

i. contacting in an aqueous reaction mixture a nucleic acid encoding a 
DNA methyltransferase wherein said methyltransferase has a molecular weight of 
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about 30-45 kilodaltons and binds to a polyclonal antibody that specifically binds to 
a polypeptide from the group of polypeptides having SEQ ID NO:2, SEQ ID NO:4, 
SEQ ID NO:6, and SEQ ID NO:8 with an antisense agent that inhibits the expression 
of the methyltransferase; and, 

ii. detecting the level of inhibition relative to a control reaction 
mixture that is substantially identical to the reaaion mixture of step i except that the 
antisense agent is not present in an amount effective to inhibit the expression of the 
methyltransferase. 

25. A method of claim 24 wherein the antisense agent is a ribozyme. 

26. A method of claim 24 wherein the reaction mixture is within a 

host cell. 

27. A method of claim 24 wherein the antisense agent is exogenously 
added to the reaction mixture. 

28. A method for assaying for inhibitors of DNA methyltransferase 
activity comprising the steps of: 

i. contacting a first aqueous reaction mixture containing a DNA 
methyltransferase wherein said methyltransferase has a molecular weight of about 30- 
45 kilodaltons and binds to a polyclonal antibody that specifically binds to a 
polypeptide from the group of polypeptides having SEQ ID NO:2, SEQ ID NO:4, 
SEQ ID NO:6, and SEQ ID NO:8 with an agent that inhibits the biological activity of 
the methyltransferase; and, 

ii. detecting the level of inhibition relative to a control reaction 
mixture that is substantially identical to the reaction mixture of 5Jep i except that the 
inhibitory agent is not present in an amount effective to inhibit the expression of the 
methyltransferase. 

29. A method of claim 28 wherein the DNA methyltransferase is not 
contained within a living cell. 
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30. A method of claim 28 wherein the DNA methyltransferase is from 
Brucella abortus. 

31. A method of claim 28 wherein more than one agent is tested at 
the same time. 

32. An assay for detecting antibiotics that target processive adenine 
methyltransferases, comprising: 

a) contacting a methyltransferase with a methyltransferase substrate in 
the presence and absence of a test substance; and 

b) detecting the enzymatic activity of the methyltransferase in the 
presence and absence of the test substance. 



33. An assay according to Claim 32 wherein the enzymatic activity 
detected in the assay is a processive activity. 



wo 98/12206 PCT/US97/16593 

1/11 

Fig. I, Rhlgobiua mellloci mechyltransf er act ^cha cequence 
mk saqusncs 1(98 b.p. GCACICXtCCO: ... QXJJl'iT f'.ar ^ T 1 Inaur 

1/1 31/11 

OCX GTC COS CCC TGG OCT OCX ACC TCC GIC CTT C7IC ACC CCC TCS OCC CCC A7C ACA 

n/21 »/3i 

GCC CCC AGC A3G TIT GCG CST CCC CCA TCC GGC ATC AAC OQ CCC ATC ACA CCT AlC ICC 
121/41 151/51 

GGC CCr TCC TTC AXA CTt OCX tCX TAA TCC AAC TAT CCC CCA OCC CCA ACA CCC CCA TGC 
ISl/Sl 2X1/71 

GCC CCC CCT GGX COX TCA CTC CZG CCC OCX GSC AAA TTT TTC CCC OCC CZT CAG GCT TIG 



241/81 271/31 

GXA AOC ATC TZC GCT AAC CAT AAC CCT ATC CXC ACT CCC ACT AAG OTT ATT TCC CAC TTC 

331/111 

ba Alt: TCX tca ctt ctt tog err ccc caa atc tec cctt goc ccc cct occ cig aac tcc 

3C1/121 391/Ul 

CXG CAC ACC ATC ATC AAG GGX GAT TGC GTC GOC GCG CIC AAC OCC CXT CCC GAT CAT TCC 
421/141 451/lSl 

GIC CXT CXC ore TTC GCC CAC GOC CCC TXT AAT CTT CAC CXC GGC GGC AOC TTC OC CCC 



4Bl/lffl 511/171 

COC GAT CAG TCC CTC GTC GAT GCX GTG CXC CAC GAT TGC CAC CXC TXT GOT TCC TIC GXX 
S41/181 571/151 

GCC IXC CXC CCr TTC ACC GGC GOC TGG OS CXT GCC TGC OCG OCT GXC CXG AAG CCC AOC 
WOl/201 €31/211 

CCC ACS CIC TCC CTC ATC GCT TCC TAC CAC AAT ATC TTC OCC CIC GCC COS ATC CIC CAG 
(Cl/221 €91/231 

GAC CrC CAC TTC TGG CTC TTC AAC CXT ATC ATC TQ5 CCC AAG AOC OkA CCC GAT GOC CAA 
721/241 751/251 

CXT CAA CCC CCC CCC TIC CAC AAC CCC CAT (SUl ACC CXG ATC TGG GCC ACC GCG AAC GCC 
781/261 811/271 

AAG CCC AAG GCT TAT AOC TTC AAC T:IC GAA GOC ATO AAG GCG GOG AAC CAC CXC CTT CAG 
341/281 671/251 

ATC CCC TCC GAC TGG CtC TTC COC ATC TGC TCC CCT TCC GAG OCC CXG AAG GGC GAC CAC 
9Q1/301 931/311 

GGC AAG AAA GTA CXC CCC ACC CAA AAG OCC GAA COG CXG CTT GOC CCC ATC CZG ATG GOC 
981/321 991/331 

TCC ACC AAG GCC GGG GXC GIC GTC CTT GAT COC TTC TTC GGC TCC GCC AOC ACC GGC COC 
1021/341 1051/351 

GTC GCC AAC OCC CIC GGC COG SC TTC GTC GCC ATC GJU3 OGC CAG CAG GAC TXT ATC GAT 
1081/351 1111/371 

GOC GCC COC CAA CCT ATC GOC GOC GIC CAG OCC CIC GCC AAG OCC AOC CX TCG GTC ATO 



1141/381 1171/391 

AOC GCC AAG AAC COC GIG OQG CCC CTC CCC TIC AAC ACT CXG CTIG GXX ACC GGG CXC ATC 
1201/401 1331/411 

AAG COC G3C ACC CTT CIC ACC CXT CCC AAC CCC CCC TM AGC CCC ATC GTC COC CCC GAC 
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Figure 1 (cont.) 



GOZ ACS 



cTOGCGTCccsGCccctacccraaiTOvrTCXca^ 



13217441 13S1/4S1 

coc cic »c cazrccij^ctxc'izsixocrrzTcac.^cm 

13ai/461 1411/471 
CCS AOC G^C 



C cac CIC AGA TCC GTC ATT COL AAC GAC CIC CCA AAA dC AAC TCA TCA AOC 



1441/4tfl 1471/491 

AOTta:c«TO5CT=TicGATAa=ccKcn:crrcffi 

1501/SOl 1S21/S11 
Gy«;CGCTrrAAACGCaWAATOTAACAGGAITCn:C^ 

15ei/S21 1591/S31 

cacaarT3u:A«ATOGcaGa:GcroccAa5«WGrocMOT 

ie21/541 1^51/551 

GCC GCC ACA TCS CCT TIC ACC CITTCa CXSJ CCS GIC AAC TOX ACC OCC TOC CAC CCS 

U0i/sn 

OCT CCS CGC err cca cat 
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Fig. 2 Rhizobium meliloti methylcransf ^rase peptide sequence 



HS 5VV5Z«XC 
DSX2XGDCV 
OVV FJWDPPV' 
OQSliVOXVO 
YDXrTKAWX. 
Ti:.WVXGSYK 

QG RKrjLUXH 

RSOWUFFXC 

KXVHPTOJ^ 
TICPGBVVLO 

XKRI*CRKFV 
AAERIAAVB 
OXKAEPRVX 
PGTVX*TDXX 

TLAsaarxa 

LDXCNGWTF 
IDELRSVIR 



ISRXXRPI.MWL 

XXLHXX.FDHSV 

KI.ai»GCTX*HRP 

ODWB aFXSFEA 

tXCRRVLXPTC 

K2FRVGXII*QD 

IWRKTaPDXBU 

BTUIWXTXKXK 

XMKXXMODVJIM 

5GSERX.XSOOO 

EXl.LXRXr.MXS 

prFCSGTTCXV 

GIERCaDYXDX 

PLGXXTLSVMT 

PHTLVESCLXK 

RRYSXIV-RXDG 

StKRl-OXXVaG 
WKFEEGSVLKP 

N D L X X L U 
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PCT/US97/ld593 



fig^ 3 Brucella aborcus methyl transferase gene sequence 
Oa sequence 1731 b.p. A^W3GCX«3»A ... a-TCWTOAAfiX linear 

1/1 

MA CCS cax cm cur ccr cm Am tcc att ato acc caa ctc gcc cm att cat tot 

61/21 51/Sl 

cm ACT ACX OOC GAC ACS CAC TIC ACT CTC QSA COS OCT GCC ACA CIC ACT GCX TCA TOC 
121/41 lSl/51 

TOl TIT CCC GCC GGA TCA TAG AOC AXA ACA -MT AAC CM GCC TEA TIG ATT GCC ACA TAT 
161/^1 211/71 

CCC GTT CCA CCC TTC CAC XTG GAT OUC CTC CTC ACC ATO ACA ACT OCA TM TTA TCT CTC 
241/81 271/Sl 

CCT TAT TC5C GG5 CCC AAA GCC CCC AM GCC GGG CTT TCC CTB TCA TAT laA GM AA6 MT 



301/101 331/111 

13UC CAT TIC AM5 CAC TTC GCC TTA ACC GCA TAT TTA CCC TAC GCA CIA ACC ATA GGA ACA 
3C1/121 3S3*iM 

ACT TTT Tits OCT TCA CAG CTA ATC GAG "DIT CDC ATC TCC CTA CIA OCT CTT GCC CAT GAG 
421/141 451/lSl 

TTB CCC ATC CAC GCC CCC OCT AGC GCC TCG dC CAC TCC ATC ATC AM CCT CAT TCC CTT 
48iyiffl Sll/171 

TCC GCC CTC GAG CCC CTC CDS GAT CAT TCC CTA GAC CTC ASC TOT GCC GAT COG CCC T3VT 
541/181 571/m 

AAT CrC CAC CTT GGC GCC GAT CIC CAC CCT OOC GAT CAC TCC ATC GTC ACC CCC GTG GAC 

Wl/201 431/211 

GAT CAT TCG GAC CAC TTT GM AGC TIC CAC GCC TAT GAC GCC TTC ADC COC CCC TCC CTC 

4SI/221 m/231 

CIC CCC TCC COC CGT GTO CIC AAC CCC AAT GGC ADC AlC TCG GIC ATC CCT TCC tAT CAC 
721/241 751/251 

AAT ATT TTC CCC CTC GGC AOS CAC TTC CAC GAT CIC GCC TTC TCG CIC CIC AAC CAC ATT 
781/261 81X/271 

CTC TCC GCC AAG ACC AAT CCC ATC CCD. MT TTC OCT GGC CCC CCT TTC CAG MT GCC CAT 
341/281 W91/291 

CM AOC CIC ATC TGG GCT TOG CCT GAC CAC AAG GGC AAG GGA TAT ACT TTC MT TAC GAC 
901/301 931/311 

COC ATC AAA GOG GCC MT GAC GAT GTC CAG ATC CCT TCC GAC TGG CIC TIC CCC ATC TCC 
941/321 991/331 

ACC GGC ACT GM OCC CTC AAC CAC CAG AAC GGC GAC AAG GTC CAC CCC AOC CAC AAC CCC 
1021/341 10^1/ JdX 

GM GCA err or gcc coc ato ato ato ccr tca aoc aac ccg ggc gac err att ctc gac 

1081/351 IXU/X/l 

OCA TTC TTC CGT TCC GGC AOC AOC GCC OOC CTC GCC AAG OGG CXT GGC CGC OIT TTC GTC 
1141/3S1 U71/391 

GGC ATC CAG OCT CM CAC CCC TAT ATC GAC GDC GCA ACC CGC CCC ATC AAT GCC CTC GAG 



1201/401 U31/411 

OCC CTT CCC AAG GCC CM CTC AOC CTC ATC ACC CGZ MG CCC GCA GAC OCC QSC GTC GCC 



wo 98/12206 PCT/US97/16593 

5/11 

Figure 3 (cont.) 

X2C1/421 12S1/431 

TIC XCC ACC CTX ATC GWl OCO CGC CTT TTC OTr COC 0» ACC aTB CTT TCT CAT »A COC 
I32Xy44I 135.W »S1 

srr TIT coc cac att crrr cec ccc cxt ogc aos cic aoc gcc aac csc caa occ ccrr 



TOW ATC CAT CCT ATT CCC GQC AOC CTT CAA GCC TTC CAT CCC TCC AAT OCC TCC ACC TTC 



1441/481 1471/491 

TOC CAC TTT GAG GAA AAC GCC CTA CIC AAA OCT A3C GAT COC CTO CSC AAC ATC ATC CCC 
1501/501 

GAA CAC ATC CCT CCC CCA OCT GCA' TAA. GAA ACT TBL ATA TOC GAC CRT CIC CAC ^^A. ACT 

15^1/521 issi/sai 

CrC ATA CCA ACC CCC TCC AAC TTT TOl AAC TIC COC CCC CTT CAT TOT TIC AGA AAG AAA 
1K1/S41 WSl/551 

OCT CTC OCC COC CCA AAT OCT COC OCA CTT TOC CIC CCC TCC TAA AAT OCA CCC CCT CCC 
1681/561 1711/571 

AOC CCC CIT CCT TCC CAC CTT OCA CAT TCT CCA TOC TCT QT CCA TCA AGA 
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Fig. 4 Brucella abortus nechylcransf erase peptide sequence 

MSLVRLAKE 
421/141 <S1/1S1 

L?IEAPRTAWLOSltKCDCV 
481/lSl 5117171 ■ 

SALERLPDSSVDVIFADPPY 
541/iai 

MLQLCCOLHRPDQSMVSAVD 
COl/201 631/211 

!)HtfDQrESFQAirDArTRAWL 
«<l/22l »1/231 

1.A CR avLXPNCTIWVXeSYH 
721/241 7SI/2S1 

jlIFRVCTOLQDLCrWLLMDI 
781/2Ct «ll/27l 

V'JRK TKPMPMrRCR***Q**AK 
341/381 B? 1/291 

STLIWASRrQKCKCY TFMYE 
901/301 931/311 

AMKAANDDVQMRSDWLFPIC 
•Cl/321 »l/33l 

TCSERLKDEHCDXVHPTQKP 
1021/341 1051/251 

SALC.ARIHMA5 S X P C D V I L 0 
X0«l/3«l llU/371 

PPrC sCTTaAVAKRI.CRKFV 
1141/3S1 1171/391 

crEREQPYlDAATARIMAVE 
1201/401 1231/411 

?LCirAEl.TVMTCKRXEPRVA 
12S1/421 1251/431 

FTSV MEACLLRPfiTVLCDER 
1321/441 135... «31 

RRFAAXVRADCTLTAHGEAG 
13B1/4C1 1411/471 

SIHR rCARVQGFOACNCWTF 
1441/481 1471/491 

WKFE EMGVCXPZOALRXXZR 
ISOX/SOl 

EQHAAACA 
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Fig. 5 ^fl^^rr p^ium cumefaciens mechyltraasf erase gene sequence 
ONX sequence 2SS b.p. A3Trio=a33iT TOSiTCCTCAAC linear 

1/1 

ATT TIC QCC CAT CCC CCG TAT AAT CIC OUS CIT CGZ CGC AAC GTO ac CSC OOC CAT CAS 



a/21 ^^^31 

tec CTC etc (SAT CCC CTTGATGACGAATOSGACCACTTCCCCTOCTTCCaCCOCTATGAC 
121/41 ISXJSl 

COC ITC ADC COC CCC TOC CTC CIC CCC TOC OOC CCT GTS CIC AAA CCC AAC CCC ADC A3C 
181/61 211/71 

TOC CTC ATC GGC TCC TXT CAC AAT ATC TTC CSC GIC CCC CCC AIC CIC CAC AAC CTO GAT 
241/81 

TTC T5C ATC CIC AAC 
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Fig. 6 Agrobac cerium tumefaclens mechylcransferase peptide sequence 



1/1 

TFXDPPyNLOr<GCNVHRPDQ 
a/21 »'31 

SLVDAVDDEWDQr^SrDAYD 
121/41 "I'Sl 

XF TRAWLLACRRVLKPHOTX 
lai/Sl 211/71 

WV icSYHMlPRVCXMLffMLO 
241/Sl 

r w 1 I. N 



W098/12»I6 



9/11 



PCT/USS»7yi6593 



FIG. 7 

Sequence ot Helicobaeter pylori CcrM hewnolog and pucaeive rascrieeion endonucleaee . 
Boxes around 'ATC' indicate staxt codons- circled nucleotides CTAA') indicate a 
stop eeden. The start codon of the downstream open reading frame overlaps the stop 
codon of the gene encoding the CcrM homolog. 

1 31 

AAC GQG CAT CCT TTCS CCA TTT CCA TTT GAA CCC ATC GGO CAC TTA TSC CTT TTT GTT OXA 

61 . 91 . - 

ICO TTT AAA ATA GOT GGO GAT AGO TAG CTT CTA TCA TTT CAT GCA TTT SAT GAG AAC AAA 
121 ISI 

OCT AGC CAC TAA ACA TTA ACA TAG CCT TAA AAC OCT TOT GTT AAA ATG CCC AGA GTA OCA 
IBl 2" 

OAT ATA AAA GGC TAG TTA ATC ATG GAT TTT TTA AAA GAA AAC TTA AAC ACT ATC ATA GAG 
241 211 

GOG GAT TST TTA GAA AAA TTG AAA CAT TTT CCT AAT AAA AGC CTT GAT TTT ATC TTT OCT 

301 "1 

OAC CCC CCA TAT TTT ATG CAA ACA GAG CGA GAA TIC AAG COT TTT CAA OGC ACA AAA TPT 

CAA GGC GTT GAG GAT CAT TOG GAT AAA TTT CCC TCT TTT CAA CAA TAC GAT ACC TTT TGT 
421 «S1 

TIC CCT TOO TTA AAA GAA TGC CAA ACC ATT TTA AAA GAT AAT CCC AGT ATT TCT CTO ATA 
iSl 

GGC AGT TTT CAA AAT ATT TTT ACA ATT OCT TTT CAT TTC CAA AAT TTA CCC TTT TCG ATA 
541 571 

CTC AAT GAT ATT GTT TGG TAC AAC AGC AAT CCG GTG CCT AAT TTT GCT GGC AAG AGA CTA 
SOI 

TOC AAC CCC CAT GAA ACC CTT ATT TCG TCC CCT AAA CAC AAA AAC AAC AAA GTT ACC TTT 
641 "1 



AAT TAT AAA ACA ATC AAG TAC CTC AAT AAC AAT AAA CAA GAA AAA TCG GTT TCC CAA ATC 
731 '51 

CCT ATT TOC ATG GGT AAC GAA AGC CTA AAA CAC CCC CAA CCT AAA. AAA GTG CAT TCC ACG 
781 "1 

CAA AAA CCA GAA CCC CTC TTA AAA AAA ATC ATT TTA ACC CCC ACT AAA CCT AAA CAC ATT 
841 871 

ATT TTA GAT CCC TTT TTT CCC ACA CCC -ACA ACA CGG GCT CTC GCT AAA TCC ATC AAC AGC 
901 931 

TAT TIT ATT CCC ATT CAA AAA GAT TCT TTT TAT ATC AAA GAA CCG GCA AAA CCC CTT AAT 
961 991 

ACC ACT ACG GAT AAA AGC GAT TTT ATC ACT AAT TTA CAT TTA GAA ACT AAA CCC CCA AAA 
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FIG. 7 (Cont.) 

,-21 lOSl 

ATC CCT ATO AGT CTT TTA ATT TCT AAA CAA TTA CTC AAA ATT GGA CAT TIT TTA TAC TCA 



i^^AAC AAA GAA AAA ATT TCT CAA CTTT TTA G^'aAC GOA CAA CTC ACW t=AT AAT GAA AAC 



nil 



1141 1"1 
TAT 



GAA ACT TCT ATT CAT AAG ATC ACC CCT AAA TAT rrC AAT AAA ACT AAC CAT AAT OGC 



1201 "31 

TCC AAA TTT TCT TAT GCO TAT TAC CAA AAT CAA TTT TTA TTO TTA GAT SAA TTC CGT TAT 

Sc TGC CAA ACC «=AC TCT TAA TC3C ACT ATC AAA CCT TTA ACQ AGA TTP TTA ATC GTT TTC 
1321 "51 

TTT TTC tfAA CAT CTA AAC CAA AAT TAC TIC AAA ATA TTG CCC AAA ATC CTC AAC OCT ATT 
13 Bl 

TGC CCA TIT TTA GAC CCA CTA AGC CTA AGA CAA AAC TAT TAC AAA ATT TAT TCA CTT CTC 
1441 

ATC AGA TTA ACT TTG CCG ATC CCT TTG AAT CCT TAA TAG AAC AAT ATT TAA AAC ACC ATA 

JSt TIT CAC CTT TAT CTA AAA AAA TTC CTT ATT ACA ATA ACC ATA AAG AAA AAA GCC AAT 
iSfil 1591 

CTT TAG AAT TAG ATC ACT TTO CTA AAA AAO ATA ACA CAT ATT ATT TTA TW» AAC AAA AAA 

tS GAG AlGACCATOACACCACCAAAAAGAGACCGCAAATAeATAACTTTSAAAGOAAAT 

ACW CTT TAC TCC ATC GTT ATC CCG AAA ACA TTC AAC CCT ATT TTT ATT TTA TAG ATC 

CTT TCA ATA AAA ATC AAA ATT ACT ATA AAG AAC AAT TGC AAA AAT TAT CTC TTC ATT 
lani 1831 

ATO GCC TCC CTT TCA GTT TGT GTT ATC CTA AGG GGT TGT TTG AAT CTC TTA ATA TCC CCC 

laci 1891 

TTT GGC ATC AGG TTT TAA GCC ATT TAC TGC GAT GCC CTC AAA CCT TAC CCC ATT TAC 



1951 

CCA GTT TCA ATT TTC ATC AAA ATC CTT TAG AAA 



1921 GAG AAA TCA AAC ATT TAG CGC 



SI^GCC TTT ATA CCA AGC TTT TGC ATA ATG Ik^M tTT TCA ATC TTC TCT TAA TTT TAT 

CAC AAC AAA AAG TTT TAA AAA TOT TAG TAC AGC ATT TTA CAC AAC AAA AT (ine««plat.> 



PCT/US97/16593 

WO 98/12206 11/11 
Amxno acid sequence of H-licob-^ter P^^''","'"^^^^*^ 

•me numbero above the amino acid i^oience^ amino acid number in the protein. 

Nucleotide number in the *f f«f f^r^iJiSTeoSIsp^^'^- " nuel«.tid... Tha 



Hence, each line contains 20 
asterisk denotes the position of the stop 

202/1 ^ M r. M T I^'l^ bGOCLEKL 

MDFI.KEMt.NTA.1 

292/31 

262/21 „«p.TFADPPVFMa 

352/Sl 

T E C C I' 

412/71 

r^'" paSFEEYDTPC LGWLKEC 

472/91 

442/81 ^eTCVlOSPQNlF 

532/111 

= F H I. 0 » = F W I L « D I V W Y 

592/131 

^'"'m PVPNFACKRI.CNAHETL 

?^'i*'c A K H K M N K v"T'f W V K T M K V 

712/171 „ _ 

682/161 SVWQIPI«=**®"^ 

772/191 .. , - 

.832/211 - - ^ 

I I. S A T K P K D I 1 L D P P P C 

892/231 „ 
r"o"T X = A V A X S M « R V P I <= I - " 

952/251 ^ „ e n 

|"T'f r 1 k e a ^ »^ « ^ = ° 

1012/271 , ^ _ 

982/261 - ^ P P K 1 P M S t L 1 

1072/291 „ T r- 

1042/281 KIGDPI'**^*''^^ 

IU2/311 » t. It 

E K O Q V K » H ■= " » « ' = ' " " 

i""s"'A K Y I. N K T H H N G W K P P V A V 

12S2/3S1 « n s * 

1222/341 rDEI.R^I = Q*^° 
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Older for more than one species to be seaiched, the appropriate additionai seaich fees must be paid. The species are as 
fi>llows: 

SEQ ID NO:2 encoding nucleic acid. Rhizobium meliloti methyltransferase peptide sequence 

SEQ ID NO:4 encoding nucleic acid, Bnicella abortus methyltransferase peptide sequence 

SEQ ID NO:6 encoding nucleic acid. Agrobacterium nimefaciens methyltransferase peptide sequence 

SEQ ID NO:8 encoding nucleic acid. Helicobacter pylori methyltransferase peptide sequence 

The claims are deemed to correspond to the species listed above in the following manner. 

SEQ ID NO:2 encoding nucleic acid - claims 2-4 (Group I) 

SEQ ID NO:4 encoding nucleic acid - claims 7-9 (Group 11) 

SEQ ID NO:6 encoding nucleic acid - claims 1 1-13 (Group III) 

SEQ ID NO:S encoding nucleic acid - claims 15*17 (Group IV) 

The following claims are generic: 1,19,24-27 

The species listed above do not relate to a single inventive concept under PCT Rule 13.1 because, under PCT Rule 13-2, 
the species lack the same or correspond'mg special technical features for the following reasons: The disclose sequences 
have diOercnt sequences and molecular size. The particular methyltrsnsferase activity is known in the art. 

This application contains claims directed to more than one species of the generic invention. These species are deemed U 
lack Unity of Invention because ihey are not so linked as to form a single inventive concept under PCT Rule 13.1. In 
order for more than one species to be searehed. the appropriate additional seareh fees must be paid. The species are as 
follows: 

SEQ ID NO:2 encoding peptide - claim 6 (Group V) 

SEQ ID NO:4 encoding peptide - claims 10.30 J 1 (Group VI) 
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SEQ ID NO:6 encoding peptide - claims 14 (Group VII) 
SEQ ID NO:8 encoding peptide - claim 18 (Group VIII) 
The following claims aie generic: 5^S^ 

The species listed above.do not felate to a single inventive concept under PCT Rule 13.1 becsuse, under PCT Rule 13.2» 
the species lack the same or conespcmding special technical featuies for the following reasons: The disclose sequences 
have difTeieat sequences and molecular size. The particular methyltransfersse activity is known in the art 

This application contains the following inventions or groups of inventions which are not so linked as to form a single 
inventive concept under PCT Rule 13.1. la order for all inventions to be searched, the appropriate additional search fees 
must be paid. 

Group IX. claim(8) 20*23. drawn to an assay for methyttrsnsfemse activity. 
Group X, claim(s) 32 and 33. drawn to an antibiotic screening assay. 
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